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The Zeitels Universal Modular (UM) Glottiscope System 
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The recently patented UM glottiscope system was conceived from the study of a century of direct laryngoscope 
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system provides the clinician with a versatile laryngoscope that optimally exposes vocal folds for diagnosis a” “ 
instrumental manipulation, regardless of the diversity of human anatomical factors: age, gender, and patholo; 


Design Features (US patent 5,893,830; international patents pending) 


e The distal lumen of the UM glottiscope is a triangular lancet-arch configuration that distracts the false voc! 
cords and conforms to the anterior glottal commissure. 

e Unlike virtually all microscope-compatible tubular laryngoscopes, which widen the proximal aperture tg 
facilitate angulation of hand instruments, the UM glottiscope has bilateral proximal slots that dramatic ` 
improve the positioning of instrumentation. 

e The UM glottiscope is supplied with a variety of examining tubes which accommodate to the spectrum of 
human size and shape, irrespective of gender, age, or disease, and that attach to a single universal handle. 

e The universal, ergodynamically designed titanium handle can be joined with a suspension gallows as well as 
American and European chest-support holders. 

e Thedetachable base-plate is ideally suited for difficult intubations; lighting and suction cannulae are contai 
within the base-plate. 
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the United States must carry the following statement 
on the title page: “This study was performed in ac- 
cordance with the PHS Policy on Humane Care and 
Use of Laboratory Animals, the NIH Guide for the 
Care and Use of Laboratory Animals,and the Animal 
Welfare Act (7 U.S.C. et seq.); the animal use proto- 
col was approved by the Institutional Animal Care 
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are listed after the abstract. 
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Primary references should be used whenever possi- 
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over 30. Use the style of references given in the 
Uniform Requirements or a current issue of the An- 
nals. Include the names of all authors and the inclu- 
sive page numbers of an article. If a manuscript 
accepted but not yet published is included in the 
reference list, give the accepting journal’s name, 
followed by “in press.” Manuscripts still in review or 
not yet accepted formally should be cited within the 
text as “unpublished observations.” A reference to a 
personal communication is also placed in the text, 
accompanied by a date (year). Papers presented at 
scholarly meetings but not published are considered 
“unpublished observations.” Papers published only 
in abstract form are listed as references with “[Ab- 
stract]” after the title. 


Tables should be on separate sheets, numbered con- 
secutively and headed by a concise title. Put explana- 
tory matter in footnotes. Tables are adjuncts to the 
text and should not repeat material presented therein. 
The cost of typesetting tables is billed to the author. 


Illustrations must be submitted in three complete 
sets, unmounted. Only professional-quality glossy 
photographs and black-and-white line drawings are 
acceptable. Computer-generated graphs are accept- 


_ able only if they are clear and will reproduce well. 
| Letters, numbers, and symbols should be clear and of 


uniform size throughout, large enough that each item 
is legible when reduced for publication. Put legends 
(detailed explanations) to the photographs on a sepa- 
rate page in the manuscript. Affix a label to the re- 
verse side of each illustration, indicating figure num- 
ber, first author’s name, and top of the figure. Give 
indications for maximum cropping and labeling on 
n overlay or photocopy. Cite each figure in the text 
nconsecutive order. Written permission from identifi- 
L subjects is required. The cost of preparing illus- 
ations for publication (sizing to fit journal, and 
reparation of negatives for printing) is charged to 
the author without exception. Color illustrations are 
accepted; cost estimates for color separations and 
printing are provided on an individual basis. Illustra- 
tions should enhance, not repeat, material presented 
in the manuscript and should be kept to a minimum. 


Rates are quoted when galley proofs are sent to the 
author. Orders are signed by the author and returned 
with the proofs. 
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following criteria for authorship and am qualified to 
be listed as an author of this work by my substantive 
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DISTAL RENAL TUBULAR ACIDOSIS ASSOCIATED WITH ISOLATED 
LARGE VESTIBULAR AQUEDUCT AND SENSORINEURAL HEARING 
LOSS 


STEFANO BERRETTINI, MD 
FRANCESCA RAVECCA, MD 


FRANCESCA FORLI, MD STEFANO SELLARI FRANCESCHINI, MD 
- MAURO MASSIMETTI, MD EMANUELE NERI, MD 
PISA, ITALY 


Distal renal tubular acidosis (ARTA) is characterized by a defect in urinary acidification with various degrees of metabolic 
acidosis; it can be inherited either as an autosomal dominant trait or as a recessive trait. The recessive form is associated in about one 
third of cases with progressive sensorineural hearing loss (SNHL). We performed a neuroradiological study in 3 consecutive unre- 
lated pediatric patients affected with sporadic dRTA and progressive SNHL that disclosed an enlarged vestibular aqueduct (VA) and 
endolymphatic sac (ES) in each. The presence of an enlarged VA in our patients with dRTA and SNHL could contribute to the 
development, or at least the progression, of the hearing impairment. We suppose that the same molecular defect present in both the 































INTRODUCTION 


Distal renal tubular acidosis (ARTA) is a disorder 
characterized by a defect in urinary acidification ac- 
companied by various degrees of metabolic acidosis 
in the absence of aminoaciduria or glycosuria. The 
reduced ability to acidify urine despite systemic meta- 
bolic acidosis or acid loading is due primarily to a 
defect in hydrogen ion secretion associated, at least 
in part, with impaired ammonium excretion by the 
distal nephron. Other metabolic alterations include 
hypokalemia due to renal potassium wasting, which 
is generally underestimated because of the systemic 
acidosis, hyperchloremia, and elevated urinary calci- 
um.!2 


In its primary form, hypokalemic dRTA usually 
occurs in young children, who, because they never 
achieve a normal acid-base balance, typically present 
with severe metabolic acidosis, growth retardation, 
rickets, and hypercalciuria with associated nephro- 
calcinosis and nephrolithiasis. The spectrum of clini- 
cal severity is very wide.!.2 Early appropriate alkali 
therapy can correct the metabolic acidosis and pre- 
vent or ameliorate the clinical consequences of the 
impaired tubular function.!.2 


Familial dRTA holds particular interest for otolar- 
. yngologists, because it has been described in associ- 
gion with sensorineural deafness.2~4 Both autosomal 
ginant and autosomal recessive inheritance of pri- 
dRTA has been observed, and the genetic de- 


kidney and the inner ear could be the cause of dRTA and of the development of the enlarged VA and ES. 
KEY WORDS — distal renal tubular acidosis, large vestibular aqueduct, sensorineural hearing loss. 


fect and severity of clinical manifestations seem to 
vary in relation to the inheritance pattern. 


The dominant form is generally milder, without 
nephrocalcinosis, and has onset at various ages; it 
sometimes is not diagnosed until adulthood.*> The 
recessive form of the disease, on the other hand, is a.. 
more severe disorder that usually presents in early . 
childhood with acute illness or failure to thrive, fe- 
ver, dehydration, nausea, vomiting, polyuria, polydip- 
sia, and early nephrocalcinosis.!2 In patients with 
recessive disease, a downsloping sensorineural hear- 
ing loss (SNHL) may be present (about one third of 
cases), with widely variable severity (from mild to 
profound) and progressive deterioration. 


A locus, ATP6B1, has been identified as respon- 
sible for recessive dRTA associated with SNHL.4 Ac- 
cording to these reports, the association of deafness 
with recessive dRTA could be explained by this mo- 
lecular disorder’s causing a defect in urine and en- 
dolymph acidification.* To date, no morphological 
study of the inner ear has been reported that includes 
high-resolution (HR) magnetic resonance imaging 
(MRI) and HR spiral computed tomography (CT) in 
patients with dRTA and SNHL, nor does the litera- 
ture contain case reports of recessive RTA and SNHL 
associated with congenital inner ear malformations. 


The clinical and audiological findings in 3 consec- 
utive pediatric patients affected with dRTA and pro- 
gressive SNHL prompted us to consider the coexist- 
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Fig 1. (Case 1) Pure tone audiograms. A) Performed at age 7 years. Shows profound downsloping sensorineural hearing loss 
(SNHL) in right ear. B) Performed at 16 years of age. Shows SNHL with flat pattern of moderate severity in left ear and right 


profound SNHL in right ear. 


ence of large vestibular aqueduct syndrome (LVAS). 
We therefore performed inner ear HR-MRI in each, 
and in the elder patient, temporal bone HR-CT, as 
well. 


MATERIALS AND METHODS 


A morphological study of the inner ear was per- 
formed on 3 consecutive unrelated pediatric patients 
affected with dRTA and progressive SNHL exam- 
ined at the ENT and Pediatric Clinics of the Univer- 
sity of Pisa. The diagnosis of dRTA was made in the 
3 patients at the ages of 5 weeks (case 1), 7 weeks 
(case 2), and 5 months (case 3) on the basis of clini- 
cal presentation (symptoms and laboratory findings) 
and response to alkali therapy. ! 


After otomicroscopic examination, a complete au- 
diological evaluation was performed on each patient. 
The tests included pure tone audiometry (0.5, 1, 2, 
4, and 8 kHz) for patients more than 6 years of age 
and conditioned oriented responses for patients be- 
tween | and 6 years of age; impedance audiometry; 
the stapedius reflex threshold test; and brain stem 
auditory-evoked potential (BAEP) testing. The hear- 
ing loss was considered progressive when a mean 
worsening of 20 dB or more was recorded on at least 
2 frequencies in the range of 0.25 to 4 kHz.®° We per- 
formed a study of the vestibular function only in the 
eldest patient. 


We also performed HR-MRI at a high field strength 
(Signa 1.5 Tesla; GE/Medical Systems, Milwaukee, 
Wisconsin) in each of the 3 patients and HR spiral 
CT of the temporal bone in the eldest patient alone 
(case 1). The HR-MRI was performed by means of a 
dedicated surface coil for signal reception and 3-di- 
mensional acquisition. Three-dimensional recon- 
structions of the endolymphatic duct and sac were 
also generated on a dedicated workstation (Ultra- 
Sparc 1; Sun Microsystems, Mountain View, Cali- 
fornia). The HR spiral CT was performed with a com- 
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bined acquisition in axial and coronal planes.’ A ves- 
tibular duct was considered to be enlarged if its cali- 
ber was greater than 1.5 mm at the midpoint between 
the common crus and the external aperture.®.? 


CASE REPORTS 


Case 1. A boy presented at 5 weeks of age with 
failure to gain weight (mean weight gain, 60 g/wk), 
generalized hypotonia, and severe metabolic acido- 
sis: pH 7.06, PO2 64 mm Hg, PCO2 30 mm Hg, bicar- 
bonate 16 mmol/L, base excess —8. Other relevant 
findings were hypokalemia (2.5 mEq/L), hyperchlo- 
remia (104 mEq/L), and hypercalciuria (7 mg/kg per 
day) with urinary pH 7.2 and radiographic evidence 
of nephrocalcinosis in both kidneys. A sporadic form 
of dRTA was diagnosed. The boy was started on a 
systemic alkali therapy (oral sodium bicarbonate and 
potassium chloride supplementation) that corrected 
the acid-base status and serum electrolyte profile, 
thereby establishing normal growth. At age 7 years, 
a profound right-sided, downsloping SNHL was di- 
agnosed (Fig 1A). Tympanometric findings were nor- 
mal bilaterally; the stapedius reflex was present on 
the right but absent on the left. Testing of BAEPs 
confirmed normal left-sided hearing and a >90 dB 
hearing level (HL) hearing loss in the range of 2 to 4 
kHz on the right. 


Between the ages of 7 and 12 years, the boy com- 
plained of fluctuations of his hearing on the left side. 
Then, when he was 12 years old, progression of the 
right-side hearing loss and a mild sensorineural defi, 
cit at 2 and 4 kHz on the left side were detected 


Over the following years, the hearing loss q 
right side remained stable, whereas the heariy 
on the left worsened. At age 16 years, a p 
audiogram revealed a flat pattern on the left ¢ 
a 45-dB HL threshold between 0.25 and 4 
profound right-sided SNHL (Fig 1B and Ta 
hearing aid was fitted on the left side. Vesti 
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TABLE 1. AUDIOLOGICAL FEATURES OF THREE PATIENTS REPORTED 








Brain Stem 





Reason for Pure Tone Type of Auditory 
Referral to Average (0.5, Hearing Vestibular Evoked Therapy for 
Case Sex Age(y) Neuroradiology 1,2 kHz) Loss Complaints Potentials Hearing loss 
l M 16 Hearing loss Right profound hearing Progressive None Normal central None (only 
loss (103 dB HL); left conduction on counseling) 
moderate (48 dB HL) both sides 
2 M 8 Hearing loss Right moderate hearing Progressive None Normal central None (only 
loss (55 dB HL) conduction on counseling) 
both sides 
3 F 4 Hearing loss Bilateral profound hearing Progressive None Normal central None (only 


loss (conditioned oriented 


responses; 98 dB HL) 
HL — hearing level. 


conduction on 
both sides 


counseling) 





amination showed normal vestibular function on both 
sides. 


No consanguinity was revealed in the patient’s 
family. Pure tone audiometry was performed on his 
parents and brother (18 years old) and yielded nor- 
mal results in each. The patient was subjected to pe- 
trous bone CT and inner ear MRI, which disclosed a 
large vestibular aqueduct (LVA) on the right and a 
large endolymphatic sac (LES) without any other in- 
ner ear anomalies (Fig 2 and Table 2). 


Case 2. A boy was hospitalized at 7 weeks of life, 
after an ambulatory sonography examination of the 
kidneys (during a screening program for the detec- 
tion of malformation uropathies) yielded results com- 
patible with bilateral nephrocalcinosis. He presented 
a mild degree of metabolic acidosis with pH 7.28, 
Po2 94 mm Hg, PCO2 23.7 mm Hg, bicarbonate 11.4 
mmol/L, base excess —13.5, and anion gap 21.6. The 
urinary pH was 6.8, calciuria 6.28 mg/kg per day, 
and urinary calcium/urinary creatinine 0.53 (both ele- 





Fig 2. (Case 1) High-resolution computed tomographic 
scan of temporal bone demonstrates enlargement of right 
vestibular aqueduct (arrow; maximum diameter, 2.8 mm). 


vated) with normokalemia and hyperchloremia (107 
mEq/L). Sporadic dRTA was diagnosed. Systemic al- 
kali medical treatment with Shohl’s solution was in- 
stituted and successfully corrected the acid-base sta- 
tus and serum electrolyte profile and allowed normal 
growth. 


At the age of 5 years, the boy underwent a routine 
pure tone audiometric test that revealed normal hear- 
ing function bilaterally. When he was 6 years old, his 
mother reportedly suspected a hearing loss on the 
right side, and at the age of 8 years pure tone audi- 
ometry revealed a moderate to severe hearing defi- 
cit at low and medium frequencies and a severe to 
profound deficit at high frequencies on the right side 
and normal hearing on the left (Fig 3 and Table 1). A 
BAEP study showed normal hearing on the left and 
a 70—-dB HL hearing loss at 2 to 4 kHz on the right. 
Tympanometric findings were normal on both sides, 
and the stapedius reflex was present on the right and 
absent on the left. Probably, there is some kind of 
consanguinity between the patient’s parents; pure tone 
audiometry yielded results within the normal range 
for both. An HR-MRI study was performed on the 
patient and revealed a right LVA and LES without 
any other inner ear anomalies (Fig 4 and Table 2). 


Case 3. At 5 months of age, a girl was admitted to 
the hospital because of poor growth and weakness. 
Laboratory analyses revealed metabolic acidosis with 
decreased serum potassium and elevated levels of 
chloride. The urine was inappropriately alkaline, 
though protein- and sugar-free; a urinary tract in- 
fection was also present. Bilateral nephrocalcinosis 
was detected by a sonographic examination. A spo- 
radic form of dRTA was diagnosed, so systemic al- 
kali medical therapy was instituted (oral sodium bi- 
carbonate and potassium chloride supplementation), 
which corrected the acid-base status and serum elec- 
trolyte profile and established normal growth. 


At the age of 13 months, the parents noticed a wor- 
sening of the child’s hearing, and a severe bilateral 
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TABLE 2. RADIOLOGIC INNER EAR FINDINGS OF THREE PATIENTS REPORTED 
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Vestibular Aqueduct Endolymphatic Sac 
Site of Anteroposterior Anteroposterior g Other Inner 
Case Sex Age (y) Involvement Diameter (mm) Volume (cm?) Diameter (mm) Volume (cm?) Ear Anomalies 
1 M 16 Right R28 SROS R3 i R03 None 
L ND L ND L ND L ND 
2 M 8 Right R3 R 0.5 R 4.5 R 0.5 None 
L ND L ND L ND L ND 
3 F 4 Bilateral R 2.1 R 0.1 R ND R 0.1 None 
[2.2 L0.2 1.2.2 L 0.2 


ND — not detectable. _ 








hearing deficit was diagnosed by means of routine tients support the assumption that this disease is trans- 
pure tone audiometry (Fig 5A). Tympanometric re- mitted through an autosomal recessive inheritance 
sults were normal on both sides, and the stapedius pattern.* 


reflex was absent on both sides. Testing of BAEPs 
confirmed a 70—-dB HL hearing loss on the right side 
and a 60-dB HL loss on the left, both in the range of ee X 
7 tò: 4'kHz The child was fitted-with bilateral hear- onset — or, more exactly, of diagnosis — of deaf- 
ing aids and referred for speech therapy. At age 4 ness in dRTA patients varies quite widely: between 

ADOT war A ak x baa 3 months and 17 years, with the highest incidence 
SEATS A WOREDE GETOS DERNE MICENE Wee dt between 2 and 6 years.4!0 In any event, the onset in 
agnosed by audiometric tests (Fig 5B), and BAEP Berea (le ye oe we y RES ; 
testing confirmed a hearing loss at 2 to 4 kHz of 90 many cases occurs before language acquisition, as 
dB HL on the right and 80 dB HL on the left (Table 
1). Inner ear HR-MRI performed at this time dis- 
closed the presence of a bilateral LVA and LES with- 
out any other inner ear anomalies (Fig 6 and Table 
2) 


The clinical features of hearing loss associated 
with dRTA are quite variable. The reported age of 


No consanguinity has been detected between the 
child’s parents. Pure tone audiometry showed nor- 
mal results in each. 


DISCUSSION 

The coexistence of hearing loss with dRTA, first 
noted in 1967 by Royer and Broyer? in 3 of 10 unre- 
lated patients (4 male and 6 female) with dRTA, has 
subsequently been documented by a number of au- 
thors.!.24 The high rate of consanguinity in the fami- 
lies of patients with deafness associated with dRTA 
and the absence of disease in the parents of the pa- 
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Fig 4. (Case 2) A) High-resolution magnetic resonance 
image of right labyrinth demonstrates severe dilatation 
Fig 3. (Case 2) Pure tone audiogram performed at age 8 of vestibular duct (single arrow; maximum diameter, 3 
years. Shows SNHL in right ear, moderate to severe at mm) and sac (double arrows; maximum diameter, 4.5 
low and medium frequencies, and severe to profound at mm). B) Maximum-intensity projection. Double arrows 
high frequencies. indicate enlarged endolymphatic sac. 
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Fig 5. (Case 3) Pure tone audiograms. A) Performed at age 13 months. Shows severe bilateral hearing loss. B) Performed at 


age 4 years. Shows progression of SNHL. 


in case 3 reported here. The SNHL is generally bi- 
lateral, although monolateral and bilateral-asymmet- 
ric cases have been described,’ as in cases 1 and 2 
herein. The degree of hearing may be from mild to 
profound,‘ and the hearing threshold curve is flat or 
slightly downsloping, similar to those of our patients 
2 and 3. 


The progressive course of SNHL, as observed in 
each of the 3 cases reported, has generally been con- 
firmed in the literature. The postverbal or late-child- 
hood onset of deafness in such patients supports this 
progressive nature,* even if detecting the progres- 
sion of hearing loss is difficult in preverbal children 
with severe to profound deafness.° There seems to 
be no correlation between the severity of systemic 
acidosis and early onset or severity of deafness,* and 
alkali replacement therapy does not seem to alter the 
severity or progression of the deafness.* 





Fig 6. (Case 3) High-resolution magnetic resonance im- 
age of left labyrinth demonstrates moderate dilation of 
vestibular duct (single arrow; maximum diameter, 2.2 
mm) associated with some degree of dilation of sac (dou- 
ble arrows). 


The clinical data reported in the literature, together 
with the overall pattern of progressive hearing loss 
in case 1, led us to the realization that the audiologi- 
cal features of dRTA-associated SNHL, in our pa- 
tients, were similar to those of LVAS. This prompted 
us to subject our patients to inner ear MRI, which 
disclosed the presence of an isolated LVA and LES 
in all of those examined. Two of the 6 ears studied 
were morphologically normal (cases | and 2): | had 
normal hearing (case 2), and the other exhibited mild, 
late-onset, slowly progressive SNHL (case 1). The 
other 4 ears, affected with severe to profound SNHL, 
presented clearly widened vestibular aqueducts and 
endolymphatic sacs. 


An enlarged vestibular aqueduct associated with 
SNHL!®:!! is considered to be the most frequent mor- 
phogenetic cause of hearing loss in children, although 
its frequency is often underestimated.®!1-!3 It is often 
associated with other inner ear malformations!!:!47! 
and, according to some authors, it must be consid- 
ered a separate clinical entity only if it is the only 
radiographically detectable inner ear anom- 
aly.8.11,12,15,16.18 Iņ the present study, a concurrent 
LVA and LES were the only radiographic evidence 
of inner ear malformation detected in each of the af- 
fected ears. 


The LVA is bilateral in 55% to94% of cases? 1016.17.19 
and generally is nonfamilial, even if a genetic pre- 
disposition is evident in some patients.'®-'8.202! Tt is 
characterized by fluctuating, progressive SNHL, of- 
ten with sudden, stepwise onset or progression re- 
sulting from activities involving the Valsalva maneu- 
ver, minor head trauma (50% of cases, according to 
Walsh et al!9), etc.9-!116.17.19-21 Some authors report 
a mixed hearing loss in a minority of their patients 
(17% to 38%).9-11.12.17.19.21.22 The SNHL is mild to 
profound and progressive in a widely variable per- 
centage of cases (11% to 60%),?:!1:16.20.21.29 and when 
found bilaterally, it has been reported by many au- 
thors to be asymmetric.'° The onset of SNHL may 
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occur from birth to adolescence, with the highest fre- 
quency in childhood,?:!0:16,20.21 and some LVAS-af- 
fected patients may experience no hearing impair- 
ment. !0,17,22,23 Vestibular symptoms are present in 
fewer than one third of cases, although vestibular 
hypofunction seems to be more frequent.?-!1.19.23 The 
clinical and audiological features of our 3 patients 
are similar to those reported in the literature in pa- 
tients affected with LVAS. 


The diagnosis of LVAS can be made either by tem- 
poral bone CT or by inner ear MRI. Temporal bone 
CT visualizes the bony vestibular aqueduct, while 
inner ear MRI discloses the fluid-filled membranous 
structures of the inner ear, allowing identification of 
the extraosseous portion of the endolymphatic sac. 
For this reason, MRI is considered by many to be 
superior to CT in the evaluation of LVAS.}415,19,24 
Not all authors agree on the criteria for definition of 
an LVA. The most widely accepted criterion is that 
advanced by Valvassori and Clemis®: a vestibular 
aqueduct is to be considered “large” whenever its 
anteroposterior diameter, measured midway between 
the outer aperture and common crus, is equal to or 
greater than 1.5 mm.3.916 


Preventing the development of hearing loss or its 
advancement is difficult, but patients and families 
must be thoroughly informed of the risks and pre- 
ventive measures, the most important of which is to 
avoid the minor head trauma that is so often related 
to hearing loss onset and progression. Counseling 
should include emphatic cautions against contact 
sports and sports engendering large and sudden baro- 
metric pressure changes, such as scuba diving, as well 
as activities involving the Valsalva maneuver, such 
as weightlifting. 


The treatment of LVAS is primarily symptomatic. 
The institution of osmotic therapy may restore or 
ameliorate the hearing loss during the acute stages,® 
and cases of a paradoxical improvement after hyper- 
baric oxygen therapy have been described.”> In LVAS- 
affected children, hearing devices and speech therapy 
must be prescribed as needed. Good results have also 
reportedly been obtained with cochlear implants.2627 
Surgical therapy is controversial: neither the endo- 
lymphatic-subarachnoid nor the endolymphatic-mas- 
toid shunts tried have yielded encouraging results,!!.14 
and the benefits of endolymphatic sac obliteration 
remain to be proved, as well.?.16.21 


To date, Pendred’s syndrome and brachio-oto-re- 
nal syndrome are the only 2 genetic syndromic con- 
ditions reported to be associated with LVA.28.29 
Therefore, on the basis of our findings, RTA com- 
bined with SNHL would represent the third genetic 
syndromic condition described in association with 


LVA. The presence of an LVA in our patients with 
dRTA and progressive SNHL could be one of the 
causes of SNHL and its overall progression. 


A statistically significant association has been dem- 
onstrated between a mutation in the gene encoding 
the B1 subunit of hydrogen adenosine triphosphatase 
(H*-ATPase; locus ATP6B 1, mapped to chromosome 
2) and recessive dRTA accompanied by deafness.4 
Such an association could be explained by the fact 
that the proton pump H*-ATPase containing the B1 
subunit, which is present at high density at the api- 
cal surface of the acid-secreting cells of the distal 
nephron, has been found in the inner ear, as well — 
in particular, in the interdental cell layer of the spiral 
limbus of the cochlear scala media and the epithelial 
cells of the endolymphatic sac. The B1 subunit could 
be involved in the maintenance of endolymphatic pH 
homeostasis, which is essential for the normal audi- 
tory function of the organ of Corti. In fact, a high 
potassium ion concentration in the endolymph of the 
cochlea would be expected to cause a passive hydro- 
gen ion efflux and consequent alkaline pH, as the pH 
of the cochlea endolymph is normally about 7.4 and 
that of the endolymphatic sac is about 6.6.4 This find- 
ing points to the existence in the inner ear of an ac- 
tive acidification process that may depend upon the 
normal functioning of the Ht-ATPase and its B1 sub- 
unit. If the B1 subunit is mutated, the loss of its func- 
tion could produce alkalinization of the endolymph 
with possible consequent impairment of hair cell 
function. This hypothesis would explain why system- 
ic alkali therapy reduces the metabolic alterations and 
ameliorates or eliminates the clinical manifestations 
of dRTA but does not prevent deafness.* 


Moreover, we can suppose that in this kind of pa- 
tient, the same molecular defect is present both in 
the distal nephron, leading to dRTA, and in the inner 
ear epithelia, leading to a hydro-electrolytic imbal- 
ance of the endolymph, which could cause an endo- 
lymphatic hydrops with a widening of the vestibular 
aqueduct and endolymphatic sac during their devel- 
opment. In this regard, recently, it has been supposed 
that the LVA results from an aberrant growth of the 
vestibular aqueduct during fetal life and the first 3 to 
4 years of postnatal life, rather than from an arrest in 
its development or failure of narrowing early in em- 
bryonic life?°; so the development of an LVA could 
be due to a hydro-electrolytic imbalance, such as that 
seen in Pendred’s syndrome and in dRTA, in which 
an H*-ATPase deficit is present in the inner ear epi- 
thelia.* 


The presence of an LVA in our patients affected 
with dRTA and SNHL could contribute to the devel- 
opment, or at least, the progression, of the hearing 


> 
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impairment. In this regard, it is interesting that the 
ears with the earliest-onset and most severe hearing 
impairment presented clearly widened vestibular 
aqueducts and endolymphatic sacs. 


Our aim here has been to highlight the association 
between dRTA and LVA and the importance of evalu- 
ating and following up dRTA patients through audio- 
logical and morphological examinations, including: 


inner ear HR-MRI, in order to disclose the presence 
of an LVA, in the hope of preventing or at least re- 
ducing the progression of hearing loss. 


We think that a better understanding of the hydro- 
electrolytic composition and of the hydro-electrolytic 
imbalance of the endolymph in this kind of patient 
could indicate a new therapy for SNHL in patients 
with dRTA. 
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IS HEARING PRESERVATION USEFUL IN VESTIBULAR 
SCHWANNOMA SURGERY? 
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The goal of our study was to examine the usefulness of preserved hearing after unilateral vestibular schwannoma removal. The 
patients were assessed by clinical evaluation, preoperative and postoperative audiometric evaluation, operative findings, and postop- 
erative functional results. Hearing was preserved in 47 of 119 patients. The postoperative hearing was better than 30 dB in 10 
patients. During the follow-up, the hearing decreased 5 dB on average, and almost all of the decrement occurred during the first 6 
months. Subjectively, the preserved hearing assisted in understanding of speech in 62% of the patients. Tinnitus did not interfere with 
understanding of speech. The age of the patients was the most significant factor associated with preserved hearing. Sixty-six percent 
of the patients with hearing preservation rated their preserved hearing as valuable. Neither tinnitus nor speech distortion reduced the 
appreciation for hearing preservation. We conclude that efforts to preserve hearing are worthwhile. 


KEY WORDS — cochlear nerve blood flow, handicap, hearing preservation, neuromonitoring, risk factor, tinnitus, vestibular 


schwannoma. i 


INTRODUCTION 


The first reports of hearing preservation in vestib- 
ular schwannoma (VS) surgery are from the 1960s.! 
With the development of microsurgical techniques, 
success rates regarding hearing preservation im- 
proved, the best being close to 80%.23 However, ac- 
cording to a recent consensus regarding criteria for 
hearing preservation, the success rate in these series 
should be between 40% and 60%,45 Tumor size 
seems to be a prognostic factor for hearing preserva- 
tion, the success rate being higher with small tu- 
mors.®7 When the extracanalicular tumor size ex- 
ceeds 15 mm, hearing preservation is generally not 
feasible.© The usefulness of preserved hearing has 
been questioned.’ 


The purpose of the present study was, first, to eval- 
uate the usefulness of preserved hearing and the role 
of tinnitus and, second, to evaluate preoperative and 
postoperative factors predicting successful outcome 
in the attempt to preserve hearing. 


SUBJECTS AND METHODS 


Subjects. During the years 1979 to 1994, 365 pa- 
tients were operated on for primary unilateral VS. In 
order to save the patient’s hearing, we preserved the 
cochlear nerve in 119 patients. In the latter group, 
the mean size of the extracanalicular tumors, mea- 
sured in the plane perpendicular to the posterior bor- 


der of the temporal ridge, was 12.5 mm (range, 5 to 
35 mm). Table 1 shows the mean tumor size, age, 
pure tone average (PTA), and speech discrimination 
score (SDS) of all patients. 


TABLE 1. PREOPERATIVE AND POSTOPERATIVE 
DATA OF THOSE NOT SELECTED AND THOSE 
SELECTED FOR HEARING PRESERVATION PROGRAM 
AND THOSE WITH AND WITHOUT SUCCESSFUL 
SURGERY 


Hearing Hearing Hearing 
Preservation Preservation Preservation 











Not Attempted Failed Successful 
(n = 246) (n =72) (n = 47) 

Age (y) 48 45 49.2 
Tumor size (mm) 25.5 15.0 12.6 
Pure tone average i 

before operation 

(dB) ' 72.5 37.1 31.0 
Discrimination 

before operation i 

(%) 42.2 76.2 82.9 
Pure tone average 

after operation 

(dB) >120 >120 54.7 
Discrimination 

after operation 

(%) 0 0 64.6 
Facial function 

(House- 

Brackmann scale) 2.1 ; 1.3 1.1 
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TABLE 2. PREOPERATIVE HEARING CLASSIFICATION 
OF ALL VESTIBULAR SCHWANNOMA PATIENTS 
ACCORDING TO GUIDELINES OF COMMITTEE ON 
HEARING AND EQUILIBRIUM” 











Hearing Hearing . Hearing 
Preservation Preservation Preservation 
Not Attempted Failed Succeeded 
Category (n = 246) (n= 72) (n= 47) 
A(PTA S$ 30dB, 49(12.8dB 39(64dB 27(8.1dB 
SDS 2 70%) and 95%) and 99%) and 99%) 
B (30 dB <PTA< 45 (11 dB 19 (8.1 dB 13 (9.4 dB 
50 dB, SDS2 and 99%) and 99%) and 100%) 
50%) 
C (PTA > 50 dB, 24 (12.4 dB 5 (14 dB 5 (8.8 dB 
SDS 2 50%) and 95%) and 100%) and 100%) 
D (PTA any level, 128 (14.2 dB 9 (8.8 dB 2 (7.5 dB 
SDS < 50%) and 98%) and 97%) and 98%) 


Each field shows number of patients falling into category (A, B, C, 
or D); mean contralateral pure tone average (PTA) and speech dis- 
crimination score (SDS) are shown in parentheses. 





Selection Criteria for Hearing Preservation Pro- 
gram. Tumor size was not a criterion for selecting 
patients for hearing preservation, although tumor size 
has been shown to correlate strongly with hearing 
preservation.® Nor was there a mandatory preopera- 
tive hearing level, another factor that has been shown 


_ to correlate with hearing preservation? (Table 210). 


In this study, all patients with preoperative hearing, 
good or bad, were considered for hearing preserva- 
tion, and the patient was finally included in the group 
for attempted hearing preservation if, in the operation, 
a cleavage plane between the tumor and the nerve 
could be elaborated. !! 


Evaluation of Hearing and Tinnitus. The patients 
operated on between 1979 and 1994 with hearing 
preservation were evaluated. Pure tone audiograms 
were obtained, and speech audiometry was per- 
formed. The patients’ subjective judgment regard- 
ing tinnitus and the usefulness of the residual hear- 
ing in the afflicted ear was elicited with 2 question- 
naires. The tinnitus questionnaire was based on the 
Tinnitus Handicap Questionnaire,!2-!3 and the hear- 
ing questionnaire was based on the Social Hearing 
Handicap Index.!4 


Comments on Surgery. The internal auditory ca- 
nal (AC) was drilled open in a lateral direction, per- 
pendicularly to its posterior lip, with care taken not 
to slant the drill, and a diamond bur was used in ap- 
proaching the lateral parts of the IAC. Since the lat- 
eral parts of the vestibular nerves are rarely free of 
tumor, the most lateral part of the IAC was usually 
exposed. At this stage, the cochlear nerve was still 
protected from surgical injury by the tumor. The tu- 
mor was removed first from the superior vestibular 
nerve above the facial nerve and then from the infe- 
rior vestibular nerve. At this point, care had to be 


TABLE 3. HEARING ONE WEEK AFTER OPERATION IN 
TUMOR EAR OF PATIENTS WITH PRESERVED 














HEARING 
Hearing Preservation. 
Category Successful (n = 47) 
A (PTA < 30 dB, SDS 2 70%) 10 
B (0 dB < PTA< 50 dB, SDS 2 50%) 10 
C (PTA > 50 dB, SDS 2 50%) 4 
D (PTA any level, SDS < 50%) 23 


Classification is according to Committee on Hearing and Equilib- 
rium guidelines. !? 





taken not to pull the tiny filaments of the cochlear 
nerve out of the foraminal tract, and therefore, the 
tumor was always removed from the cochlear nerve 
in a medial-to-lateral direction. Great care was taken 
not to harm the blood supply of the cochlea. 


Neuromonitoring. Starting in 1990, neuromonitor- 
ing was added to the operation. This included test- 
ing of facial nerve conductivity (Neuropack four, Ni- 
hon Kohden, Tokyo, Japan), also brain stem audi- 
tory evoked responses (BAERs) with small earplug- 
type headphones (Sony MD-11, Tokyo, Japan), and 
in a few cases, cochlear nerve action potentials were 
recorded during click stimuli with a silver wire elec- 
trode covered with saline-soaked cotton. In some 
cases, the nerve blood flow was assessed with a la- 
ser-Doppler flowmeter (Periflux, Perimed, Jarfalla, 
Sweden) with a 1-mm probe. 


Statistics. Linear regression analysis was used for 
assessing various factors influencing hearing preser- 
vation. A stepwise logistic regression analysis was 
used when factors determining the outcome of the 
hearing preservation surgery were evaluated. Ken- 
dall’s linear correlation analysis was used to study 
the internal correlations. Student’s f-test was used 
for comparison between the patient groups. 


RESULTS 


Hearing Preservation. Hearing was preserved in 
47 of the 119 patients. In 10 patients with preserved 
hearing, the tumor was larger than 15 mm. Table 310 
presents the postoperative hearing in the afflicted ear. 
Table 4 lists the postoperative hearing of the 27 pa- 
tients with preserved hearing who before operation 
had a PTA of no more than 30 dB and an SDS of at 
least 70%. Of these, 10 patients had better hearing 
than 30 dB. The mean follow-up was 7.3 years (range, 
1 to 16 years). The hearing deteriorated gradually in 
many cases, and 4 patients lost their hearing com- 
pletely (after 3 months in 1 patient, 1 year in 1 pa- 
tient, and 2 years in 2 patients). 


Factors Determining Success Rate of Hearing 
Preservation. Our prediction model for hearing pres- 
ervation, comprising age, tumor size, PTA, SDS, and 
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TABLE 4. PREOPERATIVE AND POSTOPERATIVE PTA LEVELS AND SDS FOR 27 SUBJECTS WITH PRESERVED 























HEARING 
Patient PTA I SDS 1 PTA2 SDS 2 PTA 3 SDS 3 PTA 4 SDS 4 

No. (dB) (%) (dB) (%) (dB) (%) (dB) (%) 

38 30 92 75 50 45 68 40 100 

48 15 96 15 96 20 88 35 84 

56 25 100 30 96 15 89 ` 30 92 

83 20 96 30 92 35 100 40 92 

93 30 100 40 92 40 80 45 84 

96 30 96 70 0 70 20 85 0 
109 30 92 40 92 40 92 45 92 
187 20 94 60 0 60 0 60 0 
306 30 92 35 92 45 75 80 . 0 
321 20 96 30 100 35 96 50 92 
336 30 92 40 80 25 68 60 92 
347 10 88 60 90 35 96 30 92 
350 25 100 60 0 60 0 65 0 
351 25 92 45 80 35 80 40 92 
358 10 76 70 0 30 92 35 72 
380 30 92 75 0 75 16 75 0 
381 5 92 60 0 50 0 70 0 
396 0 96 30 92 25 92 10 100 
399 25 92 55 0 60 100 50 96 
402 10 96 45 92 45 92 50 88 
412 20 88 30 84 40 88 45 92 
451 20 100 40 40 70 20 70 20 
452 25 100 30 96 20 100 25 100 
454 5 92 20 92 15 92 30 100 
457 25 92 25 92 25 92 30 92 
459 10 80 70 0 70 0 70 -0 
460 15 80 15 92 15 92 20 92 


Mean preoperative PTA < 30 dB; mean preoperative SDS 2 70%. 


1 — before operation; 2 — 1 week after operation; 3 — 3 months after operation; 4 — at end of follow-up time (mean, 7.3 y). 





hearing level at 1,000 Hz in pure tone audiometry, 
proved to be valuable (p < .003). In the model, age 
(p < .008) and SDS (p < .06) came out as the stron- 
gest predictors. Tumor size, PTA, and hearing at 1,000 
Hz did not reach significance. Nevertheless, when 
the predictive value of all factors was evaluated with 
a linear discriminant function analysis, the sensitiv- 
ity was 29% and the specificity 80%. Thus, if based 
on age and SDS, the model predicted the outcome of 
surgery correctly in 62% of the cases, the false-nega- 
tive rate being 32%. 


Usefulness of Hearing Preservation. The question- 
naire indicated that 72% of the patients found their 
preserved hearing useful (Table 5). Sound distortion 
was not a problem in 65% of the patients, but was 
distressing in 15%. However, 71% of the patients 
had difficulty in following a conversation during fam- 
ily discussions, 56% had problems with group dis- 
cussions at work, and 26% found it hard to follow 
news on television or radio. 


The subjective usefulness of hearing preservation 


correlated significantly with speech discrimination 
in daily life (r = .85, p < .001), sound localization (r 
=.79, p < .001), and following of family discussions 
(r= .33, p < .01). The postoperative hearing level or 
tinnitus did not correlate with the subjective useful- 
ness of preserved hearing. 


The variability could be predicted in 91% (p < 
.001) of the cases when the predictive factors of the 
benefit of the preserved hearing were studied with 
an analysis of variance. The following factors were 
included in the analysis: sound localization (p < .001), 
PTA (p < .01), and SDS (p < .001). ` 


Tinnitus and Hearing Preservation. Tinnitus was 
present in 68% of the patients. It was localized to the 
afflicted ear in all but 2 patients, who had tinnitus in 
both ears. Table 6 shows the subjective handicap 
caused by tinnitus. Only 1 patient had major prob- 
lems with tinnitus. None of the patients used sleep- 
ing pills regularly. 

We found correlations between the severity of tin- 
nitus and some factors: postoperative hearing level 








` 


Č 


Leyo et al, Hearing Preservation in Vestibular Schwannoma Surgery 395 


TABLE 5. SUBJECTIVE EVALUATION OF USEFULNESS 
OF PRESERVED HEARING 


No. of Patients 
Rating Hearing 
Preservation 
(n = 39) (Mean 








Hearing Speech Sound 
of — Discrim- Locali- 





Contralateral Sound ination zation 
Rating* PTA/SDS) (n = 39) (n=39) (n= 39) 
None 8 (16 dB/97%) 8 15 28 
Undecided 20 (10 dB/100%) 26 23 23 
Some 23 (3 dB/100%) 20 21 15 
Moderate 13 (3 dB/97%) 13 18 21 
Very 36 (11 dB/97%) 33 23 13 


Patient means of contralateral hearing are shown in parentheses. 
Other data are percentages of patients. 


*Usefulness of preserved hearing. 





at 500 Hz (r = .29, p < .05), discrimination of speech 
when listening to television (r = .27, p < .05), quality 
of life (r = .43, p < .001), interference with sleep (r = 
.80, p < .001), concentration ability (r = .47, p < .001), 
and social alienation (r = .64, p < .001). 


DISCUSSION 


In the present study, hearing was preserved in 39% 
of the patients in whom the cochlear nerve was pre- 
served. This result is very similar to that of other 
studies.!516 Four patients (8.5%) with hearing later 
experienced significant hearing impairment. Simi- 
lar impairment has been reported to occur in other 
series with the same frequency.’ There was gradual 
deterioration of hearing also in the other patients, 
but the impairment was minimal and differed little 
from that of the intact ear (mean PTA impairment, 
8.9 dB; SD, 5.5 dB). There was a tendency for those 
with long-term hearing preservation to have smaller 
tumors and a better PTA at 1,000 Hz, as indicated by 
other studies.” 


The preserved hearing was considered beneficial 
by 66% of the patients, although the preserved hear- 
ing was better than 30 dB in only 10 patients. Pre- 
served hearing was perceived as being valuable for 
speech discrimination during conversation. We found 
that neither distortion of speech nor aggravation of 
tinnitus caused major disability after surgery. Select- 
ing patients for attempted hearing preservation, as 
proposed in the literature, on the basis of an SDS of 
better than 30% and a hearing level of better than 70 
dB? was not supported by our study. In the patients 
with preserved hearing, the largest tumor was 35 mm. 
This finding would indicate that a large tumor is not 
necessarily an exclusion criterion for attempted hear- 
ing preservation. However, in the group not consid- 
ered for hearing preservation, the mean tumor size 
was larger, indicating that a larger tumor does re- 
duce the chance of hearing preservation. Although 


TABLE 6. SUBJECTIVE DISABILITY AND USE OF 
SLEEPING PILLS DUE TO TINNITUS ACCORDING TO 
SEVERITY OF TINNITUS FOR PATIENTS WITH 











PRESERVED HEARING 
Rating Subjective Interference Use of 
Severity of Disability With Sleep Sleeping Pills 

Tinnitus (n = 39) (n = 39) (n = 39) 
No tinnitus 38 38 38 
Very mild 5 38 54 
Slight 33 21 8 (occasionally) 
Moderate 21 3 0 (weekly) 
Severe 3 0 0 (every night) 


Data are percentages of patients. 








there were no collected data regarding the interface 
between the tumor and the cochlear nerve, it seemed 
to be an important factor for hearing preservation. 
The cleavage plane was more easily elaborated in 
small tumors, enabling less traumatic tumor removal. 
Furthermore, our impression was that the chances 
for hearing preservation were dismal when the co- 
chlear vessels ran between the tumor and the cochlear 
nerve. When the vessels were located between the 
facial nerve and the cochlear nerve, the chances for 
hearing preservation were more favorable. Unfortu- 
nately, this correlation was made later in the course 
of the hearing preservation program, and therefore, 
these data were not systematically gathered. We did 
not include the BAER in our model, although ac- 
cording to Glasscock et al,!7 it is the variable most 
significantly determining the success rate. Intraop- 
erative BAER monitoring may facilitate learning of 
important steps in hearing preservation surgery. 


There is controversy as to what should be consid- 
ered useful, acceptable, or serviceable hearing. Usu- 
ally, audiometric criteria are advocated, and it is said 
that the 50-50 rule should apply to serviceable hear- 
ing.!7 Our patients’ subjective evaluation of the use- 
fulness of their residual hearing indicates that the lev- 
els are beyond these limits, and there was no corre- 
lation between the measured PTA levels and speech 
discrimination. The second general belief is that if 
the hearing level between the ears differs by more 
than 20 dB, the weaker ear does not aid in conversa- 
tion.!7 In the subjective evaluation of sound locali- 
zation, 49% of our patients were of the opinion that 
the preserved hearing was beneficial, even though 
the difference in hearing level between the ears was 
less than 20 dB in only 10 cases (Table 2). 


It is claimed that leaving the cochlear nerve in- 
creases the risk of handicapping tinnitus.!8 In 5 pa- 
tients with a deaf ear and tinnitus, we removed the 
nonfunctioning cochlear nerve. In 2 of them, the tin- 
nitus was significantly relieved, but in 3, it was un- 
changed. This experience has discouraged us from 
attempts to eliminate tinnitus by reoperation. Among 
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the patients with residual hearing, tinnitus was a ma- 
jor problem in only 1 patient. Thus, the argument 
that leaving the cochlear nerve will worsen tinnitus 
seems unfounded. 


Use of the BAER taught us to handle the nerves 
more gently, but in this study it did not add to the 
results of hearing preservation. Direct cochlear nerve 
recording, promoted by Moller et al,!9 is at present 
the most accurate method. It must be stressed that 
for facial nerves, such monitoring has improved the 
success rate of facial function significantly, and op- 
erating without monitoring the facial nerve almost 
seems malpractice. We have recently started to mea- 
sure the blood flow of the cochlear nerve and the 
inner ear in order to shed light on dangerous mo- 
ments in the surgery. At present, we assume that deaf- 
ness following surgery is primarily due to insuffi- 
cient blood flow in the cochlea rather than in the 


nerve. We hope that our further studies will provide 
a more accurate answer to this question. 


CONCLUSION 


In our study, hearing preservation was successful 
in 39% of the cases in which the cochlear nerve was 
left intact. Of these patients, 66% rated the preserved 
hearing as valuable or very valuable. Therefore, hear- 
ing preservation is useful in VS surgery. Neither tin- 
nitus nor speech distortion was a major problem af- 
ter the surgery. No single factor predicting success- 
ful hearing preservation could be identified. Neuro- 
monitoring did not improve the results. The surgical 
technique is demanding, and hearing loss seems to 
be a result of compression of the vasculature of the 
inner ear rather than of the cochlear nerve. Consider- 
ing these experiences, we strongly advocate that sur- 
geons continue efforts to preserve hearing in VS sur- 


gery. 
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TEMPORAL BONE HISTOPATHOLOGY IN CHARGE ASSOCIATION 
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Three temporal bones obtained at autopsy from 2 patients with CHARGE association (Coloboma, congenital Heart disease, 
Atresia of choanae, mental Retardation and/or central nervous system anomalies, Genital hypoplasia, and Ear anomalies) were 
examined histopathologically. Both temporal bones from 1 patient showed multiple anomalies in the middle ear, inner ear, and facial 
nerve. However, the temporal bone obtained from the other patient showed almost normal structures in the inner ear, middle ear, and 
eustachian tube. These results are different from those of 2 previous reports of temporal bone histopathology regarding CHARGE 
association. This difference suggests that CHARGE association may arise not from one etiopathogenetic factor, but from complex 
factors. Special attention to dehiscent facial canal and perilymphatic gusher during otologic surgery in patients with CHARGE 


association is discussed. 
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INTRODUCTION 


The mnemonic CHARGE association (C — colo- 
boma, H — heart disease, A — atresia of choanae, R 
— retarded development and/or central nervous sys- 
tem anomalies, G — genital hypoplasia, and E — 
ear anomalies and/or deafness) was first proposed 
by Pagon et al! in 1981. In terms of the otologic as- 
pects, more than 80% of patients with this associa- 
tion have anomalies of the external ear, ranging from 
small ears without malformation of the pinna to cup- 
shaped, lop ears, in addition to mild or profound deaf- 
ness. Approximately 50% of these patients also have 
facial nerve palsy.! At this point, even though oto- 
laryngologists have often encountered patients with 
CHARGE association, there have been only a few 
reports of temporal bone histopathologic findings.?3 
We had an opportunity to evaluate histopathological- 
ly 3 temporal bones obtained from 2 patients with 
CHARGE association. Anomalies observed in these 
temporal bones will be reported and discussed with 
regard to clinical implications. 


MATERIALS AND METHODS 
CASE 1 


A 3,580-g, full-term boy was delivered by cesar- 
ean section to a 38-year-old mother. His initial Apgar 
score was 5/8. The infant was noted to have a right 
retinal coloboma, a large ventricular septal defect, 
low-set ears, cleft lip and palate, clinodactyly, and a 
left cervical skin defect. A chromosome study showed 


anormal karyotype. The auditory and visual responses 
were poor. Behavioral audiometry on 2 occasions had 
shown severe to profound loss of voice awareness in 
the sound field. At 5 months of age, the child under- 
went repair of the cleft lip, placement of a palatal 
prosthesis, and bilateral myringotomy with ventila- 
tion tubes. At age 22 months, the patient died of acute 
bronchopneumonia caused by gastroesophageal re- 
flux. An autopsy performed 9 hours after death re- 
vealed growth retardation, asymmetric temporal lobes 
with gyral malformation, genital hypoplasia, and a 
tiny thymus gland with a single parathyroid gland. 
CHARGE association and partial DiGeorge syn- 
drome were the final pathological diagnoses in this 
patient. 


Both temporal bones were removed at autopsy, 
fixed in 10% formaldehyde solution, decalcified in 
trichloroacetic acid, dehydrated in graded concen- 
trations of ethanol, embedded in celloidin, and sec- 
tioned horizontally at a thickness of 30 mm. Every 
10th section was stained with hematoxylin and eosin 
and examined under a light microscope. 


CASE 2 


A 1,920-g girl was born by cesarean section to a 
19-year-old mother after a 34-week gestation. The 
pregnancy was complicated by maternal diabetes 
mellitus, hypertension, and polyhydramnios. The ini- 
tial Apgar score was 3/5. Multiple anomalies were 
observed, including congenital heart anomalies (pa- 
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Fig 1. Case 1, horizontal sections. A) Left ear. Mastoid portion of facial nerve (FN) is dislocated anteriorly to change anatomic 
relation with stapedius muscle (SM) from posterior-anterior relation to anterior-posterior relation. Moderately large bony 
dehiscence of facial canal also is observed (arrow). Hypoplastic endolymphatic sac (ES) without rugose portion is seen in 
vestibular aqueduct that is more anteriorly dislocated than its normal position. High jugular bulb (HJB) is also noted. Bar — 
1 mm, CTN — chorda tympani nerve, RWM — round window membrane, RWN — round window niche. B) Left ear. Organ 
of Corti is severely collapsed. Cochlear nerve fibers are absent, and osseous spiral lamina is notably thin (black arrow). Bony 
defects are observed in modiolus (MD; open arrow). Spiral ganglion cells are totally lost in Rosenthal’s canal (RC). Bar — 0.5 
mm. C) Left ear. Utricular macula (UM) is present and innervated by superior vestibular nerve (SVN). Utricle (U) is relatively 
small. Bar — 1 mm, ME — middle ear cavity, JN — Jacobson’s nerve, GG — geniculate ganglion cells, C — cochlea, IAC — 
internal auditory canal. D) Right ear. Stapes, oval window, and stapedius muscle are completely absent. Geniculate ganglion 
cells (GG) are dislocated in tympanic portion of facial nerve (black arrow). Endolymphatic duct, which is widely communi- 
cated with saccule (S) and utricle (U), is observed (open arrow). Bar — | mm, white arrows — fissula ante fenestram, SVN — 
superior vestibular nerve, SaM — saccular macula, UEV — utriculo-endolymphatic valve, IVN — inferior vestibular nerve. 
E) Right ear. Facial nerve (FN) is bifurcated in labyrinthine portion, and Bill’s bar is poorly developed (arrow). Bar — 1 mm, 
C — cochlea, SVN — superior vestibular nerve, UM — utricular macula. 





tent ductus arteriosus and ventricular septal defect), esophageal fistula with esophageal atresia, a bifid 
choanal atresia, low-set, cup-shaped ears, tracheo- left toe, and vertebral anomalies. At 1 day of age, 


) 
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surgical closure of the tracheoesophageal fistula was 
performed. By 3 weeks of age, the infant underwent 
surgical repair of the ductus arteriosus. Unfortunately, 
she had an episode of cardiac arrest during surgery 
and could not be resuscitated. An autopsy that was 
done 16 hours after death revealed growth retarda- 
tion (height and weight below the fifth percentile) 
microcephaly (266 g; normal, 413 g), microphthal- 
mia, and unicornate uterus with hypoplasia of the 
left fallopian tube. The final pathological diagnosis 
was CHARGE association. 


A specimen including the left temporal bone and 
left eustachian tube was removed at autopsy.* The 
specimen was processed in the same way as those 
for case 1. Finally, the specimen was sectioned ver- 
tically at a thickness of 20 to 30 um. Every 20th sec- 
tion was stained in hematoxylin and eosin and exam- 
ined under a light microscope. 


HISTOPATHOLOGIC FINDINGS 


Only the pathological findings observed in these 
cases are reported; all structures not described were 
observed to be within normal limits. 

CASE 1 


Left Middle Ear. The stapes was hypoplastic. Al- 
though the stapedius muscle was recognized, it was 
located posterior to the mastoid portion of the facial 


Fig 2. Case 2, vertical sections from left ear. Bars — | mm. A) 
Cartilaginous portion of the eustachian tube. Lumen (L) of eu- 
stachian tube is relatively wide. ETC — eustachian tube carti- 
lage, TVPM — tensor veli palatini muscle, LYPM — levator 
veli palatini muscle. B) Cochlea ( C) and middle ear (ME). Co- 
chlea is normally developed in its 2'/2-turn coils. TTM — ten- 
sor tympani muscle, TM — tympanic membrane. C) Vestibule 
(V) and middle ear (ME). Their structures are well developed 
and normal. Facial nerve (FN) is facing middle ear cavity 
through bony defect (arrow) commonly seen in inferior aspect 
of facial canal. I — incus, OW — oval window, SH — stapes 
head, UM — utricular macula. 





nerve. Malposition of the geniculate ganglion cells 
in the tympanic portion of the facial nerve and an 
anteriorly translocated mastoid portion of the facial 
nerve were observed. A huge bony dehiscence of the 
facial canal facing the tympanic cavity in its antero- 
inferior aspect was also observed in the mastoid por- 
tion (Fig 1A). The round window niche, antrum, and 
mastoid air cells were obliterated by mesenchymal 
tissue. 


Left Inner Ear. The cochlea was hypoplastic, and 
the third turn was missing. The organ of Corti was 
severely collapsed. The cochlear nerve and its spiral 
ganglion cells were absent. The osseous spiral lamina 
was notably thin. Multiple bony defects were ob- 
served in the modiolus (Fig 1B). 


In the vestibule, although the utricular endolym- 
phatic fluid space was relatively small, the utricular 
macula was present and innervated by the superior 
division of the vestibular nerve (Fig 1C). The oval 
window was extremely small and hypoplastic. All 3 
semicircular canals and their ampullae were absent. 
A hypoplastic endolymphatic sac with absence of the 
rugose portion was observed in the anteriorly trans- 
located large vestibular aqueduct (Fig 1A). 

Right Middle Ear. The stapes and stapedius muscle 
were absent (Fig 1D). The facial nerve was bifurcated 
in the labyrinthine portion (Fig 1E), and the genicu- 
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late ganglion cells were translocated to the tympanic 
portion (Fig 1D). A huge bony dehiscence of the fa- 
cial canal facing the tympanic cavity anteriorly was 
observed in the mastoid portion. The round window 
niche, antrum, and mastoid air cells were obliterated 
by mesenchymal tissue. 


Right Inner Ear. A reduction in the number of co- 
chlear turns was observed; the third turn of the co- 
chlea was incomplete. 


The findings in the vestibule and semicircular ca- 
nals were similar to those in the left temporal bone 
(Fig 1D,E). The oval window was absent. 

CASE 2 


Middle Ear and Eustachian Tube. A moderate 
amount of cartilaginous tissue remained in the area 
surrounding the fissula ante fenestram. The lumen 
of the eustachian tube in the cartilaginous portion 
was slightly wide (Fig 2A). 

Inner Ear. There were no anomalies or any other 
pathological findings observed in the inner ear (Fig 
2B,C). 


DISCUSSION 


Pagon et al! were the first to define CHARGE as- 
sociation as the pathological condition that includes 
a coloboma or choanal atresia or both, and a total of 
at least 4 of the following abnormalities: coloboma, 
congenital heart disease, choanal atresia, postnatal 
growth retardation, mental retardation, and/or anoma- 
lies of the central nervous system, genital hypopla- 
sia, and anomalies of the ears and/or deafness. The 
cause of CHARGE association is still unknown. Chro- 
mosomal abnormalities have been reported in cases 
of CHARGE association, suggesting that several 
genes may play a role in its pathogenesis.>” Van Me- 
ter and Weaver,’ however, postulated that CHARGE 
association is an example of dysblastogenesis, due 
to abnormal migration of neural crest cells, deficiency 
of mesodermal formation, or abnormal interaction 
between mesoderm and neural crest cells. 


In the report by Dhooge et al, there were 2 cases 
that had anomalies similar to those in case 1 (CHRGE) 
and 2 cases similar to ĉase 2 (HARGE). However, 
there were none among their 4 cases with cleft lip or 
palate. Furthermore, in case 1, characteristics of 
CHARGE ~ssoéiation and DiGeorge syndrome over- 
lapped. Five patients similarto case1 were reported 
by de Lonlay-Debeney. et al: rio However, neither case 
1 nor those 5 patients had a 22q11.2 chromosomal 
microdeletion,!° which has been reported in approxi- 
mately 90% of DiGeorge syndrome patients,!!-!? as 
well as in CHARGE association.® 


Wright et al? reported 4 cases of CHARGE asso- 
ciation, but in only 1 case was a temporal bone evalu- 


ated histopathologically. This case had coloboma, 
congenital heart disease, choanal atresia, central ner- 
vous system, and ear anomalies (CHARE). Abnor- 
malities observed in the temporal bones were 1) mal- 
formation of incus and stapes, 2) absence of oval win- 
dow and stapedius muscle, 3) displacement of facial 
nerve and dehiscence of the facial canal, 4) shortage 
of cochlear duct, 5) reduced population of spiral gan- 
glion cells and Scarpa’s ganglion cells, 6) large patent 
cochlear aqueduct, 7) broad ductus reuniens, 8) small 
saccular maculae, 9) small utricle with near absence 
of utricular maculae, and 10) absence of bony and 
membranous semicircular canals and their ampul- 
lae. Similarly, Guyot et al3 reported the pathological 
findings in 2 temporal bones obtained from a patient 
with CHARGE association who had coloboma, heart 
disease, central nervous system anomalies, and deaf- 
ness (CHRE). The temporal bone abnormalities were 


1) absence of oval windows, 2) cochlear dysplasia, 


3) absence of spiral ganglion cells, 4) absence of 
semicircular canals and utricles, 5) reduced number 
of vestibular neurons, and 6) wide and short endolym- 
phatic sac. They termed this specific form of laby- 
rinthine dysplasia “CHARGE dysplasia,” which in- 
cluded complete absence of the pars superior (bony 
and membranous semicircular canals and utricle) and 
a Mondini dysplasia of the cochlea with preserva- 
tion of the saccule and its innervation.!3 In other 
words, the anomalies in the pars superior were more 
severe than those in the pars inferior. 


The anomalies in the pars superior in our case 1 
(CHRGE) were also more severe than those in the 
pars inferior. While the small utricle and its macula 
were similar to those reported by Wright et al,” the 
anomalies in the pars inferior were similar to those 
in the case of Guyot.et al.3 Interestingly, in case 2 
(HARGE), no anomalies of the middle and inner ears 
were found. These observations indicate difficulties 
in defining the characteristics of temporal bone ab- 
normalities in CHARGE association unitarily. The 
observation that this association does not arise from 
one etiopathogenetic cause is supported by these dif- 
ficulties. 


Anomalies in the middle and inner ear have been 
observed in the subtypes CHARE,? CHRE,? and 
CHRGE (case 1) of CHARGE association, but the 
“HARGE” subtype (case 2) revealed no such anoma-’ 
lies. Since very few temporal bones from patients with 
CHARGE association have been evaluated histopath- 
ologically, it is difficult to define characteristic fea- 
tures of temporal bone abnormality in CHARGE as- 
sociation. However, it might be worth mentioning that 
patients with the 3 subtypes CHARE, CHRE, and 
CHRGE (case 1), which include coloboma (abbre- 
viated as C) as a part of their anomalies, all had se- 
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vere anomalies in the middle and inner ears, while 
our patient with subtype “HARGE” (case 2), with- 
out coloboma, had normal middle and inner ears. 
Also, it should be mentioned that in cases of the sub- 
type “HARGE” (case 2), even if the middle and in- 
ner ears are normal, the external ear may have anoma- 
lies. Hearing loss may be caused in some of those 
cases by the presence of severe external ear abnor- 
malities. 


Patients with CHARGE association frequently 
have facial nerve palsy,' an abnormal course of the 
facial nerve, and a large bony dehiscence of the fa- 
cial canal. Therefore, surgeons have to be aware of 
the possibility of causing trauma to the facial nerve 
during their surgical approach into the middle ear 





through the external meatus or antrum. Also, if pa- 
tients with CHARGE association undergo cochlear 
implant surgery, there is a risk of a perilymphatic 
gusher!4 due to the malformation of the modiolus. 
Preoperative high-resolution computed tomography 
and magnetic resonance imaging should be helpful 
in predicting and preventing iatrogenic injury dur- 
ing surgery. It is important for otolaryngologists to 
be knowledgeable about the temporal bone abnormal- 
ities in CHARGE association, since they often en- 
counter such patients in the clinical setting. 


In conclusion, further detailed analysis and classi- 
fication of temporal bones in patients with CHARGE 
association will provide knowledge as to the etiol- 
ogy and pathology of this disorder. 
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EARLY HISTOPATHOLOGIC CHANGES AFTER LATERAL 
SEMICIRCULAR CANAL TRANSECTION AND OCCLUSION 
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Occlusion of the semicircular canals has been used in otoneurologic and skull base surgeries with preservation of hearing. In 
order to understand the role of occlusion in hearing preservation, we observed early histopathologic changes of the lateral semicircu- 
lar canal following transection and occlusion in guinea pigs. After surgery, the membranous endolymphatic canal was collapsed, and 
its torn ends were sealed by intramural bone dust in the bony semicircular canal. On the first postoperative day, the endolymphatic 
canal was expanded to its normal shape, and the sealing at the torn ends was maintained by the bone dust. These findings suggest that 
occlusion after semicircular canal injuries, by making blind ducts in the membranous endolymphatic canal, is important for the 


preservation of postoperative hearing. 


KEY WORDS — hearing preservation, injuries and occlusion of semicircular canal, temporal bone histopathology. 


INTRODUCTION 


Hearing preservation after a membranous labyrin- 
thine injury has been of great interest to all otologists 
and neurotologists for decades. There has been much 
clinical evidence that the semicircular canals can be 
manipulated without hearing loss.}-!6 Many animal 
studies have supported the possibility of partial laby- 
rinthectomy.!7-28 However, the mechanism of hear- 
ing loss and hearing preservation in membranous 
labyrinthine injuries is not yet clear. 


Our previous study demonstrated that the membra- 
nous endolymphatic canal healed with blind ducts 
after plugging of a transected semicircular canal.?5 
The purpose of this study was to investigate early 
histopathologic changes of the lateral semicircular 
canal shortly after transection and occlusion and to 
determine the role of occlusion in hearing preserva- 
tion. 


MATERIALS AND METHODS 


Animals. Four Hartley guinea pigs weighing from 
250 to 300 g with normal auditory brain stem re- 
sponses (ABRs) were used. The animals were raised 
and maintained in a conventional facility at Chung- 
nam National University Hospital. 


Auditory Evaluation. Sound stimuli (broadband 
click, rarefaction phase, 11.29 Hz) were generated 
and delivered by an audiometer equipped with an 
insert earphone (Quantum 84, Cadwell Laboratories 


Inc, Kennewick; Washington). The rectal tempera- 
ture and oxygen saturation were monitored through- 
out the procedure. Thresholds were determined by 
attenuating the stimulus by 20-dB increments from 
70 dB sound pressure level until the waveform was 
lost and then raising it by 5-dB increments until the 
waveform appeared. The ABR thresholds were mea- 
sured before surgery and/or sacrifice. 


Surgical Technique. Under general anesthesia by 
means of intraperitoneal injection of ketamine hydro- 
chloride (50 mg/kg), the right side of the bulla was 
exposed laterally. The prominent portion of the lat- 
eral semicircular canal, posterior to the facial canal, 
was transected with a surgical drill. Free leakage of 
the inner ear fluid was observed from the two open- 
ings of the ampullated and nonampullated sides of 
the transected semicircular canal. The canal lumens 
were occluded with bone dust by low-speed drilling 
with a diamond bur until there was no fluid leakage. 


Histopathology. Two animals were painlessly sac- 
rificed after the surgery, and the other 2 on the first 
postoperative day. The animals were deeply anesthe- 
tized and perfused intravitally with normal saline so- 
lution and Heidenhain-Susa fixative. The temporal 
bones were removed, fixed, and processed as de- 
scribed by Schuknecht.2? The specimens were em- 
bedded in celloidin and sectioned horizontally at 20- 
um thickness. Every 10th section was stained with 
hematoxylin and eosin and examined under a light 
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Fig 1. Schematic drawing of changes in lateral semicir- 
cular canal following transection and occlusion. Ampul- 
lated side of lateral semicircular canal. A) Occlusion seals 
torn end of membranous endolymphatic canal. B) Re- 
covery on first postoperative day. Cross-sectional pho- 
tomicrographs at a, b, c, and d are shown in Fig 2A, 2B, | 
2C, and 2D, respectively. : 


microscope. 


RESULTS 


On histopathologic examination of the temporal 
bones, 2 animals in each group showed similar histo- 
pathologic features. On the day of surgery, the mem- 
branous endolymphatic canal was collapsed and the 
perilymphatic trabecular meshwork was detached 
from the bony canal wall (Figs 1A and 2A). The torn 
ends of the endolymphatic canal were sealed by bone 
dust that was introduced into the bony canal lumens 
by low-speed drilling (Figs 1A and 2B). On the first 
postoperative day, the collapsed endolymphatic ca- 
nal was expanded to its normal shape, with reestab- 
lishment of the perilymphatic trabecular meshwork 
(Figs 1B and 2C). The membranous endolymphatic 
canal was still sealed by the intramural bone dust 
(Figs 1B and 2D), maintaining 2 blind ducts at the 
ampullated and nonampullated sides of the lateral 
semicircular canal. The cochlea and vestibular recep- 
tors appeared normal. 


After the operation, all animals exhibited sponta- 
neous nystagmus and an operated side—down head- 
tilt position. The ABR thresholds were changed less 
than 10 dB from the preoperative levels. These find- 
ings were similar to our previous observations.”3 


DISCUSSION 
Surgical manipulations on the semicircular canals 


have been attempted in humans*+7.9-!6 and experi- 
mental animals!7:19-23,25-28 with various techniques. 
Most of these techniques were concerned with mini- 
mizing the risk of membranous labyrinthine injury 
by using less traumatic means, such as nonmechan- 
ical occlusion using a laser,?+!3-15.20.21.25 mechanical 
occlusion after a careful fenestration,41!5:16.25 and 
transection after a secure occlusion.>:!9.11.14.19 How- 
ever, even after a membranous endolymphatic canal 
rupture, hearing could be preserved successfully by 
occluding the injured semicircular canal.9+17;22,23.26,27 
Therefore, our experiment was aimed at determin- 
ing the effect of occlusion in hearing preservation 
following a membranous labyrinthine injury. Under- 
standing the occlusion effect may be helpful for the 
further extension of partial labyrinthectomy. 


Hearing loss in membranous labyrinthine injuries 
may be attributable to a mechanical or chemical ef- 
fect on the cochlea.39 In the early stage of membra- 
nous labyrinthine rupture, the cochlea may be in- 
sulted by the mechanical effect, including capillary 
rupture, hemorrhage, and direct cellular damage, or 
by the chemical effect of a massive perilymph-endo- 
lymph contamination.3 Once the endolymphatic space 
is violated, the early-type injuries to the cochlea seem 
to be inevitable, even though we try to minimize them. 
However, the early-type injuries that occur during 
manipulation of the semicircular canals seem to be 
minimal or transient. Kobayashi et al!® found that 
complete interruption of the lateral semicircular ca- 
nal did not affect the endocochlear potential or com- 
pound action potential in guinea pigs, or the com- 
pound action potential in humans. Smouha et al2?:26.27 
performed transection of the lateral semicircular ca- 
nal with intentional suctioning of the inner ear fluid 
in guinea pigs. Suctioning of the inner ear fluid led 
to a transient hearing loss with eventual recovery. 


In the late stage, the cochlear and vestibular recep- 
tors may continue to suffer from the chemical effect 
of fluid intermixing through a communication be- 
tween the two chemically different spaces (endolym- 
phatic and perilymphatic), if the fistula is not closed.”3 
The persistent ionic contamination must result in 
anergy of the cochlear receptors. Smouha et al2?,26.27 
also suggested that electromechanical factors, rather 
than hair cell death, were responsible for hearing loss 
after partial labyrinthectomy. We believe that the ul- 
timate hearing loss following a membranous labyrin- 
thine rupture is attributable to this late-type injury.”3 


For an explanation of hearing preservation in mem- 
branous labyrinthine injuries, Jahrsdoerfer et al! ad- 
dressed the valvular function of the utriculoendolym- 
phatic valve, which could prevent the escape of en- 
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Fig 2. Corresponding photomicrographs of Fig 1 at a, b, c, and d are A, B, C, and D (H & E. original x100). A) Partially 


collapsed endolymphatic canal and detached perilymphatic trabeculae. B) Endolymphatic canal (arrowheads) that is totally 
collapsed and sealed by intramural bone dust (s). C) Completely expanded endolymphatic canal and reestablished perilym- 
phatic trabecular meshwork. Bony spicule (arrow) is seen in perilymphatic space. D) Maintenance of endolymphatic sealing 


(arrowheads) by intramural bone dust (s) at occluded portion. 


dolymph from the saccule and cochlea in a sudden 
decompression of the vestibular endolymphatic sys- 
tem. However, our previous temporal bone study 
demonstrated that the valve was not functioning when 
the inner ear fluid was suctioned from a transected 
semicircular canal in guinea pigs.°? Canalis et al3 
thought that the sudden loss of endolymph and sup- 
porting structure collapsed the membranous endo- 
lymphatic canal and sealed its torn ends. Tatagiba et 
alë proposed an incidental closure of the opened ca- 
nal by bone fragments preventing endolymph leak- 
age, and a homeostatic function of the endolymphatic 
sac in recovery. The findings in the current study sup- 
port their concepts of membranous canal collapse? 
and closure by bone fragments.® We believe that the 
occlusion of an injured semicircular canal, by pre- 
venting fistulization between the endolymph and peri- 
lymph (late-type injury), is important in preserving 
postoperative hearing.?3 


We used bone dust in this study and muscle in our 
previous study as an occlusion material.?3 Other au- 


thors used bone wax,?:!0.!1,14.22.25-27 bone dust,!5:!7 


fibrin glue,'? a mixture of bone dust and fibrin glue,‘ 
a mixture of fascia, bone dust, and fibrin glue,!6 fas- 
cia,? or periosteum.® No matter which occlusion ma- 
terial was used, our study demonstrated that the in- 
tramural material contributed mechanically to the for- 
mation and maintenance of blind ducts in the mem- 
branous endolymphatic canal. These findings sug- 
gest that a simple covering or seal may not be ade- 
quate to prevent intralabyrinthine communication in 
membranous labyrinthine injuries. If there is any sus- 
picion of membranous labyrinthine injury during sur- 
gery, we prefer canal occlusion to the conventional 
sealing technique in treatment of semicircular canal 
fistula. Chemical or tissue reactions of the occlusion 
materials in the labyrinth are unknown, and further 
experiments are required. 


The semicircular canals are unique portions of the 
labyrinth that have a narrow bony canal lumen that 
is easy to occlude, and their location far from the 
cochlea enables us to treat them with relative safety. 
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In the vestibule, there seems to be no effective means 
to prevent intralabyrinthine communication follow- 
ing a saccular or utricular rupture. Packing the ves- 
tibular lumen may result in obliteration of the vestib- 
ular perilymphatic space that provides the entrance 
of sound energy into the cochlea.” However, the ves- 
tibule may not be a limitation in partial labyrinthec- 
tomy. McElveen et al>-!! were able to preserve post- 
operative hearing after modified translabyrinthine re- 
moval of acoustic neuromas. Their success in hear- 
ing preservation might be attributable to their avoid- 
ing saccular or utricular rupture and maintaining the 
vestibular perilymphatic space. The extent of partial 
labyrinthectomy is not defined clearly, and further 
experiments are needed. 


Animals may show responses to labyrinthine in- 
juries that are different from those of human beings. 





Surprisingly, we could not cause anacusis in chinchil- 
las, even when the transected semicircular canal was 
left unoccluded. The animals usually died of debili- 
tation from persistent vestibular symptoms (unpub- 
lished observations). Other authors also observed 
similar findings in rats?4 and guinea pigs.28 These 
findings reflect interspecies differences, and careful 
interpretation may be needed in animal models. 


In summary, occlusion following transection of the 
lateral semicircular canal in guinea pigs formed blind 
ducts in the membranous endolymphatic canal. The 
immediate interruption of the communication be- 
tween the endolymph and perilymph seems to be im- 
portant in preserving postoperative hearing. For the 
development of safer and more reliable techniques 
of partial labyrinthectomy, further experimental stud- 
ies are required. 
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The main aim of this study was to validate a new technique, neural response telemetry (NRT), for measuring the electrically 
evoked compound action potential in adult cochlear implant users via their Nucleus CI24M implant. Thirty-eight adults were evaluated 
with a variety of measurement procedures with the NRT software. Electrically evoked compound action potentials were obtained in 
31 of the 38 adults (81.6%) and in 132 of the 160 electrodes (82.5%) tested. In addition to validating this technique, we also established 


a set of default clinical test parameters. 


KEY WORDS — cochlear implant, electrically evoked compound action potential, neural response telemetry. 


INTRODUCTION 


Impedance and compliance telemetry is useful in 
assessing electrode function of most of the current 
cochlear implant systems. However, because of the 
variation in spiral ganglion cell survival associated 
with deafness, variation in performance across co- 
chlear implant users remains unpredictable despite a 
fully functioning electrode array. Some clinical mea- 
sure of peripheral neural function in addition to tele- 
metric assessment of electrode function is a reason- 
able approach to addressing this issue. Such a mea- 
surement technique would need to be validated before 
considering its clinical implications. 


In 1992, a bidirectional telemetry feature was im- 
plemented in the design of the Nucleus CI24M co- 
chlear implant system? that can be used to record 


Fig 1. Neural response telemetry (NRT) system 
comprising personal computer (PC) with in- 
stalled NRT software and IF5 interface card, 
clinical processorcontrol interface, SPrint speech 
processor, headset with coil for radio frequency 
(RF) transmission, and Nucleus CI24M implant 
including electrode array. Stimulation param- 
eters transmitted via RF link from processor to 
implant are received and decoded (Rx Decode) 
and presented as stimulus (Stim) by implant. 
Evoked compound action potential is then re- 
corded (Rec), and data are encoded for transmis- 
sion (Tx Encode) back to implant over same RF 
link. 







Host PC with NRT software 
and IF5 card 


electrically evoked compound action potentials 
(ECAPs). The action potentials that result from a 
stimulus applied on a given intracochlear electrode 
are recorded from a neighboring electrode, amplified, 
and then encoded for transmission via the radio fre- 
quency (RF) link back to the speech processor. Cus- 
tom (neural response telemetry (NRT) software, writ- 
ten at the Department of Otorhinolaryngology, Uni- 
versity Hospital of Ziirich, communicates with the 
speech processor to capture, process, store, and dis-. 
play the measurement data on a host personal com- 
puter, as shown in Fig 1. The NRT software controls 
the parameters of the stimulus used to evoke the re- 
sponse being measured, as well as the parameters 
used to perform the recording. 


The NRT software implements the masker-probe 
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A: Probe only 


Measurement A consists 
of Probe Artifact and 
Neural Response 


Measurement B consisis 
of Masker Artifact and 
Probe Artifact but no 
Neural Response as a 
result of Forward-Masking 


Measurement C consists 
of Masker Artifact only 
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paradigm described by Brown et al? that is illustrated 
in Fig 2 and involves a sequence of 1) probe-only 
(A), 2) masker-followed-by-probe (B), and 3) mask- 
er-only (C) stimuli. The probe-only condition (A) 
yields the desired neural response plus an artifact 
from the probe. The masker-and-probe condition (B) 
with an appropriate masker advance yields stimulus 
artifacts from both masker and probe; the neural re- 
sponses to the probe are absent or diminished be- 
cause of forward masking. The masker-only condi- 
tion (C) yields only the masker artifact. By using the 
subtraction method (A — [B — C]) to cancel the large 
masker stimulus artifacts found in each condition, 
one can extract the relatively small neural response. 


The main aim of this study was to validate the 
NRT measurements and the software with adult users 
of the Nucleus 24 cochlear implant and to determine 
the major test parameters necessary to maximize the 
chances for recording an ECAP. The NRT stimulation 
parameters on 1 electrode pair are illustrated in Fig 

Stimutus Start of 
Onset Measurement 


i Time from l 
: Stimulus Onset “} 







Masker 





Total of 32 Samples in High Resolution Mode 


Subtraction Paradigm Fig 2. Subtraction method for re- 
A-(B-C) ducing masker stimulus artifact. 
ree Stimulation signals from buffers 
A, B, and C and their respective 
responses are shown on left, and 
subtraction paradigm (A — [B — 
C]) with resultant enhanced neu- 
ral response is shown on right. 
First negative peak is labeled N1, 
and following positive peak is la- 
beled P1. Ampitude of neural re- 
sponse is defined as N1-P1 peak- 
to-peak amplitude. 
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3. The specific test (stimulation and recording) pa- 
rameters investigated in this study included 1) stim- 
ulation mode (wide bipolar or monopolar); 2) mea- 
surement delay (time period between the end of the 
probe stimulus and the start of the measurement: 35 
to 150 us); 3) stimulation rate (number of stimulus 
pulses per second: 80 or 35 Hz); 4) number of sweeps 
(number of averaged responses: 50, 100, or 200); 5) 
masker level (stimulation level of the masker stimu- 
lus); and 6) masker advance (time interval between 
the end of the masker stimulus and the onset of the 
probe stimulus). 


METHOD 


A total of 38 users of the Nucleus C]24M cochlear 
implant (21 men and 17 women; age range, 24 to 75 
years; mean age, 50.5 years) from 5 European co- 
chlear implant centers participated in the study. Each 
subject had to have a minimum of 18 intracochlear 
electrodes and be willing to participate in the trial. 
No subjects were excluded from this study on the 
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Fig 3. Diagram of NRT parameters shows masker amplitude, masker pulse width, masker advance, probe amplitude, probe 
pulse width, measurement delay, and sampling period. In standard resolution, 16 samples at 10 kHz spanning 1.5 ms are 
captured. In high-resolution mode, 2 sweeps of 16 samples, each offset by 50 us, are recorded. and interlaced to form single 
data set. Time axis origin is at probe stimulus onset. : 
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TABLE 1. SUMMARY OF SUBJECT DATA AND MEASUREMENT RESULTS 
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Neural Response 
Age (y) Cause of Hearing Electrode Electrode Electrode Electrode Electrode 

Subject* Sex at Testing Impairment 20 15 10 5 I 
Fl F 29 Ototoxicity Yes DNT Yes DNT Yes 
F2 M 43 Progressive loss No DNT No DNT No 
F3 F 44 Progressive loss No DNT No DNT No 
F4 F 49 Progressive loss Yes DNT Yes DNT Yes 
F5 F 49 Progressive loss No DNT No DNT No 
F6 M 61 Sudden loss Yes DNT Yes DNT Yes 
F7 M 62 Progressive loss Yes DNT Yes DNT Yes 
F8 M 69 Sudden loss No DNT Yes DNT Yes 
H1 F 67 Progressive loss DNT DNT Yes DNT Yes 
H2 M 42 Meningitis Yes Yes Yes Yes Yes 
H3 M 64 Progressive loss Yes Yes Yes Yes Yes 
H4 F 24 Progressive loss No Yes Yes Yes Yes 
H5 M 45 Sudden loss DNT DNT No DNT No 
H6 M 63 Scarlet fever No DNT No No No 
H7 M 64 Progressive loss Yes DNT Yes DNT Yes 
Kl M 25 Unknown Yes Yes Yes Yes Yes 
K2 M 40 Unknown Yes Yes Yes Yes Yes 

K3 F 41 Meningitis DNT DNT Yes DNT DNT 
K4 M 55 Unknown Yes Yes Yes Yes Yes 
K5 F 56 Unknown Yes Yes Yes Yes Yes 
K6 M 64 Unknown Yes Yes Yes Yes Yes 
K7 M 58 Unknown Yes Yes Yes Yes Yes 
K8 M 75 Unknown Yes Yes Yes Yes Yes 
Li F 43 Progressive loss Yes Yes Yes Yes Yes 
L2 M 43 Progressive loss No Yes Yes Yes Yes 
L3 M 57 Progressive loss No No No No No 
L4 M 61 Progressive loss Yes Yes Yes Yes Yes 
z F 42 Progressive loss Yes Yes Yes Yes Yes 
Z2 F 51 Progressive loss Yes Yes Yes Yes Yes 
Z3 F 28 Progressive loss Yes Yes Yes Yes No 
ZA F 45 : Sudden loss Yes Yes Yes Yes Yes 
Z5 F 62 Progressive loss Yes Yes Yes Yes Yes 
Z6 M 72 Progressive loss No Yes Yes Yes Yes 
z7 F 58 Progressive loss Yes Yes Yes Yes Yes 
Z8 M 52 Progressive loss Yes Yes Yes Yes Yes 

Z9 M 37 Progressive loss Yes Yes Yes No DNT 
Z10 F 37 Progressive loss Yes Yes Yes Yes Yes 
Zil F 43 Progressive loss Yes Yes Yes No No 


DNT — did not test. 


*Subjects are identified by letter identifying test center (F — Freiburg, H — Hannover, K — Kiel, L — Lund, Z — Zürich) and sequential 


number for each center. 














basis of any other criterion apart from age or consent cochlear implant system as shown in Fig 1. The im- 
to participate. The details (gender, age, and cause of plant has 22 intracochlear electrodes, 1 extracochlear 
hearing impairment) of the subjects included in this ball electrode on a flying lead placed under the tem- 


study are summarized in Table 1. 


poralis muscle (MP1), and a plate electrode on the 
receiver-stimulator (MP2). Measurements were made 


The NRT measurements in all of the test centers for each subject at different stimulation-recording 
were completed after operation with the same NRT sites along the electrode array, as summarized in Ta- 
v1.04 software and similar test setups consisting of ble 2. Wide bipolar, as well as monopolar, stimulation 


a personal computer, IF5 interface card, processor modes were initially investigated (Brown et al‘) with 
control interface, SPrint speech processor, and CI24M a subset of the subjects. For the recording, the mono- 
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TABLE 2. ELECTRODE CONFIGURATIONS TESTED 








Stimulating Indifferent Recording Indifferent 
Electrode Electrode Electrode Electrode 
10 20 15 MP2 
20 10 15 MP2 
1* 11 6 MP2 
1* MP1 3§ MP2 
5 MP1 7 MP2 
10 MP1 12 MP2 
15 MP1 17 MP2 
20 MP1 22 MP2 


*Or most basal possible. 
{Stimulating electrode + 10. 
{Stimulating electrode + 5. 
§Stimulating electrode + 2. 





polar mode was chosen because it had been shown 
to give the largest response amplitudes in a prelim- 
inary pilot study. For bipolar stimulation modes, the 
active recording electrode was selected to be in the 
middle between the 2 bipolar stimulation electrodes. 
For the monopolar stimulation mode, the active re- 
cording electrode was selected to be 2 electrodes 
more apical of the active stimulation electrode. Indi- 
vidual centers investigated the effects of different test 
parameters described, above (stimulation rate, num- 
ber of sweeps, measurement delay, masker level, and 
masker advance). 


RESULTS 


In general, all of the centers were able to obtain a 
reliable response with NRT across a variety of co- 
chlear implant users (Table 1) under different mea- 
surement conditions. Neural responses could be ob- 
tained from 31 of the 38 subjects (81.6%) and on 
132 of the total of 160 electrodes tested (82.5%). 


Figure 4 shows a typical amplitude growth series 
indicating how the neural response changes with the 
stimulation level. The first negative peak (N1) of the 
neural response occurs approximately between 200 
and 300 us after the stimulus onset, followed by the 
first positive peak (P1), occurring at approximately 
700 us after the stimulus onset. The response am- 
plitude is defined as the peak-to-peak amplitude from 
N1 to P1. Plotting the change in response amplitude 
as a function of the stimulation level yields the ampli- 
tude growth function (AGF). _. 


Data comparing wide-bipolar versus monopolar 
stimulation were initially collected from 15 subjects. 
Neural responses were obtained from 13 of these sub- 
jects. Comparison of the AGFs from bipolar and 
monopolar mode stimulation showed that in the ma- 
jority of cases (10 of the 13 subjects; 76.9%), mono- 
polar stimulation produced larger neural responses 
than equivalent wide-bipolar stimulation, as illu- 
- strated by the example in Fig 5. All subsequent results 


Probe Current Level 


a : 








Time in ms | 


Fig 4. Typical series of neural responses recorded with 
NRT v1.04 software. In this example, monopolar stimu- 
lation was presented on electrode 1 (indifferent electrode 
MP1) and recorded from electrode 3 (indifferent elec- 
trode MP2). Probe current level varied from 230 to 200 
CL (current levels; internal units of Nucleus CI24M co- 
chlear implant), while masker current level was fixed at 
230 CL. Note that time axis origin corresponds to time 
of probe stimulus onset. 


were then based on the findings made in monopolar 
stimulation mode according to the electrode config- 
urations shown in Table 2. 


The effect of varying the measurement delay was 
examined with 34 subjects. Six of these subjects 
yielded no neural responses. From the other 28 sub- 
jects, the results showed that shorter delays were 
more likely to result in overloading of the measure- 
ment amplifier, producing a distorted waveform (Fig 
6; 46 us). Increasing this delay reduces the likelihood 
of saturating the measurement amplifier, but exces- 
sive delays could result in missing early features of 
the waveform, such as the N1 peak (Fig 6; 84 us). 
Careful adjustment of the delay parameter is required 
to produce a clear neural response (Fig 6; 70 us). 
The optimal delay depends on the subject and the 
measurement amplifier gain selected, and it ranged 
between 35 us and 100 us for these 28 subjects. 


Another way to avoid nonlinear operation of the 
measurement amplifier is to reduce the gain from 60 
dB to 40 dB. The recordings for 10 subjects that 
showed amplifier saturation effects at 60-dB mea- 
surement amplifier gain were improved when the 
gain was lowered to 40 dB, thereby yielding clear 
neural responses even with a short delay. 


A comparison of 2 stimulation rates (35 Hz and 


80 Hz) was carried out in 31 subjects, and an example 
is shown in Fig 7. No significant difference in the 
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Fig 5. Example of amplitude growth functions comparing 
neural responses from bipolar versus monopolar stimula- 
tion. In this instance, stimulation active electrode was 
10, and indifferent electrode was either 20 (bipolar) or 
MP1 (monopolar; see Table 2). In 10 of 13 subjects, neu- 
ral responses to monopolar stimulation were larger than 
bipolar counterpart at same stimulation level. 


resulting neural response was found between the 2 
stimulation rates in all 31 cases. Also, for a given 
stimulation level, the perceived loudness was found 
to be softer, to various degrees, at the lower stimu- 
lation rate in all cases. 


Excellent recordings of the ECAP have been ob- 
tained by averaging over only a small number of 
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Fig 6. Effect of variations of measurement delay on same 
electrode with all other test parameters kept constant. 
Jagged waveform at 46 us indicates nonlinear saturation 
behavior of measurement amplifier. Longer delays restore 
linear behavior of amplifier. Increasing delay indefinitely 
would eventually result in not capturing N1 peak. In this 
example, measurement delay of 70 us produces best neu- 
ral response. 
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Fig 7. Comparison of 2 stimulation rates, 35 Hz and 80 
Hz, with all other test parameters kept constant. Choice 
of stimulation rate has no effect on recorded neural re- 
sponse. i 


sweeps (Fig 8). Data from 19 subjects investigated 
all showed that at 60 dB of gain, as the number of 
sweeps was increased, the relative noise in the record- 
ing improved. Beyond 100 sweeps, there is little sub- 
jective improvement in the quality of the response 
waveform. At 40-dB gain, 200 sweeps were neces- 
sary to obtain a clear neural response. 


The effect of the masker level was also investigated 
in 13 subjects, and an example typical of all 13 sub- 
jects is shown in Fig 9. The neural responses on the 
left in Fig 9 were measured with a fixed masker at 


No. Sweeps 








o 05 4 15 2 25 
Time in ms 


Fig 8. Effect of variations of number of sweeps across 
which measurements are averaged from 10 to 250 sweeps, 
with all other test parameters kept constant and measure- 
ment amplifier gain set to 60 dB. Increasing number of 
sweeps improves signal-to-noise ratio. Beyond 100 
sweeps, there is little improvement in quality of recorded 
waveform. 
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Fig 9. Comparison between NRT measurements recorded 
with fixed high masker level (left) and responses with 
masker level that was varied with probe stimulus level 
(right). Note that on left, clearly defined N1 is seen at 
probe level of 190 CL. On right, similar N1 response is 
seen at probe level of 195 CL. 


the highest acceptable presentation level, and the re- 
sponses on the right were recorded with a masker set 
to the same level as the probe stimulus level. The 
responses obtained with the fixed high masker level 
(set at the loudest acceptable presentation level for 
the NRT stimulus) show a lower neural response 
threshold than do the responses obtained with the 
masker level set to be the same as the probe level. 
The paradigm in which the masker and probe varied 
together was preferred by the subjects, because they 
were not always presented with a continuously loud 
signal during the measurement, as the stimulation 
level was varied. Although it was not the preferred 
condition, all subjects found the fixed high masker 
level acceptable. 


The effect of the masker advance was examined 
with 22 subjects. Figure 10 illustrates how the re- 
sponse amplitude typically varies as the masker ad- 
vance is increased from 125 us to 2,500 us. The NRT 
response amplitude increases with increasing masker 
advance up to approximately 500 us, at which the 
largest response amplitude was recorded. The ampli- 
tude then decreases markedly with longer masker 
advance intervals. Note that the chart is plotted on a 
log-log scale. 


Only 7 of the 38 subjects tested demonstrated no 
neural response on any of the selected test electrodes 
with this technique. 


DISCUSSION 


The ability to record neural responses in 82.5% of 
the tested eleçtrodes (81.6% of the subjects) supports 
the validity of the NRT system for measuring ECAPs 
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Fig 10. Effect of masker advance on neural response 
amplitude for intervals from 150 to 4,000 us. Largest 
response amplitude occurs around masker advance of 500 


Us. 


in adult subjects. Variability in response waveforms 
and response amplitudes across subjects and different 
stimulation and recording sites, which may be depen- 
dent on physiological factors such as cause of hear- 
ing impairment, local nerve survival,>-’ or location 
of stimulation and recording electrodes within the 
scala tympani, can only be properly evaluated after 
test parameters have been appropriately optimized. 


The measurement delay, the gain of the measure- 
ment amplifier, the masker level, and the masker ad- 
vance were shown to have an effect on the ability to 
obtain a neural response in an individual cochlear 
implant user, and therefore optimization of the test 
parameters was important to obtaining a repeatable 
response. 


Varying the measurement delay parameter essen- 
tially changes the time window used to capture the 
neural response. Nonlinear behavior of the measure- 
ment amplifier would introduce distortions to the 
neural response. The measurement delay is needed 
in order to prevent the very large stimulus artifacts 
accompanying the neural response from driving the 
amplifier into nonlinear operation. The measurement 
delay, however, should be kept as short as possible 
in order to capture the N1 peak. The absence of the 
N1 peak would compromise estimates of the response 
amplitude. Thus, there is a tradeoff between being 
able to capture the neural response as early as possible 
and avoiding nonlinear operation of the amplifier. 
The delay parameter therefore needs to be carefully 
adjusted to obtain a clear neural response. 


Related to the issue of avoiding nonlinear behavior 
of the measurement amplifier is the gain setting of 
the amplifier. A smaller gain is less likely to result in 
saturation of the amplifier. However, for a given 
amount of internal measurement noise, the signal- 
to-noise ratio at 40-dB gain is obviously poorer than 
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at 60 dB, and more averaging, and consequently a 
longer measurement time, is then required to get rid 
of the accompanying noise. As a shorter measurement 
time is generally more desirable, a default gain of 60 
dB should be used whenever possible. A 40-dB gain 
should be selected if optimization of the measurement 
delay at 60 dB of gain fails to produce a clear neural 
response. - 


The use of a fixed high masker level produces a 
more accurate record of the neural response, primar- 
ily because of the forward-masking nature of the sub- 
traction paradigm. Ideally, the subtraction paradigm 
requires that the stimulated neurons be fully refrac- 
tory from the masker so that there is no neural re- 
sponse to the subsequent probe signal in the masker- 
and-probe condition (Fig 2, measurement B). If there 
is insufficient forward masking, the remnant neural 
response evoked by the probe signal will result in a 
reduction of the resultant response amplitude after 
the subtraction. This effect is seen in Fig 9, in which 
the response amplitude falls off more rapidly when 
the masker is not kept at a fixed high level. 


The masker advance is also a crucial parameter in 
ensuring that the subtraction paradigm works as it is 


meant to. The optimal amount of forward masking: 


appears to occur at around 500 us of masker advance. 
With longer masker advances, the reduction in for- 
ward masking results in an underestimation of the 
response amplitude due to the subtraction paradigm. 
At shorter masker advances, however, the reduction 
in the response amplitude cannot be a result of insuffi- 
cient forward masking. Instead, it is probably a result 
of the neural response to the masker stimulus en- 
croaching into the measurement window. This would, 
again, result in an underestimation of the actual neural 
response amplitude. 


The results suggest that the stimulation rates tested 
have little or no effect on the measured neural re- 
sponse. All clinicians preferred the faster rate (80 
Hz maximum), as this reduced the overall test time. 
That the perceived loudness decreases with decreas- 
ing stimulation rate indicates that some temporal in- 
tegration effects are present. This finding implies that 
the stimulation rate parameter may be reduced to 
achieve a higher stimulation level without raising the 
perceived loudness of the test signal. 


The minimum number of sweeps required to obtain 
a clear neural response is expected to be subject- 
dependent, but is primarily determined by the amount 
of accompanying noise in the measurement. In in- 
stances in which the neural response amplitude is 
very large compared to the measurement noise, very 
few sweeps are needed to capture the neural response. 
However, this is rarely the case, particularly when 


one is attempting to measure lower response ampli- 
tudes, and some averaging is required. With the am- 
plifier gain set to 60 dB, it was found that 100 sweeps 
were usually sufficient to yield a clear neural response. 
At an amplifier gain of 40 dB, the minimum number 
of sweeps needs to be approximately doubled to 200 
sweeps to compensate for the reduced signal-to-noise 
ratio. 


A set of default test parameters has therefore been 
established. A stimulation rate of 80 Hz is preferred 
over 35 Hz, as little detrimental effect was found on 
the resultant response, and the overall test time was 
reduced. A test parameter series using a measurement 
amplifier gain of 60 dB, 100 sweeps, a fixed masker 
level equal to the highest acceptable presentation lev- 
el, and a masker advance of 500 us has been shown 
to provide the quickest, most reliable results in the 
majority of adults tested. 


After the above parameters have been set, some 
other test parameters must still be optimized for in- 
dividual subjects and electrodes in order to capture a 
clearly defined neural response waveform. These in- 
clude the measurement delay and, in some cases, the 
amplifier gain and stimulation rate. Further back- 
ground details that are useful for performing param- 
eter optimization with the NRT software have been 
described by Lai® and are also treated by Abbas et 
al.? 


One point noted during the course of the study 
was that in a number of cases, no neural responses 
could be obtained, even at the highest acceptable 
presentation level with the default parameters. How- 
ever, when the stimulation rate was reduced from 80 
Hz to 35 Hz, the perceived loudness of the test signals 
decreased. This reduction, in turn, allowed the stim- 
ulation level to be increased before the highest ac- 
ceptable presentation level was reached at the lower 
stimulation rate. At higher stimulation levels, the neu- 
ral response begins to appear. The implication here 
is that a certain amount of synchronicity in the neural 
firing needs to be present before the neural response 
can be recorded via NRT. It is therefore conceivable 
that in instances in which subjects have a low sub- 
jective threshold, the level of synchronous activity 
is insufficient for NRT to register the neural response. 
The mechanism behind this observation is currently 
under investigation. 


Possible reasons for the lack of a recordable re- 
sponse in a small number of cases (7 of 38 subjects) 
include test parameters that may not have been fully 
optimized, excessively high electrical impedance of 
the stimulating or recording electrodes, a presentation 
level that was not adequate to elicit a synchronous 
neural response, or other physiological factors. All 
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of the above findings need to be considered in the 
development of common protocols for future inves- 
tigations. 
CONCLUSION 
Neural response telemetry provides a unique meth- 





od to assess peripheral neural function in Nucleus 
CI24M users. The NRT measurement technique has 
been shown to be viable for recording the ECAP or 
neural response. A set of default test parameters has 
been established to provide a relatively quick method 
of measuring the ECAP in the majority of subjects. 
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INDUCTION OF MUCOUS CELL METAPLASIA BY TUMOR NECROSIS 
FACTOR ALPHA IN RAT MIDDLE EAR: THE PATHOLOGICAL BASIS 
FOR MUCIN HYPERPRODUCTION IN MUCOID OTITIS MEDIA 
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MINNEAPOLIS, MINNESOTA 


Mucoid otitis media (MOM), one of the leading causes of acquired hearing loss in children, is characterized by mucous cell 
hyperplasia in the middle ear cleft associated with mucin accumulation in the middle ear cavity. The factors that stimulate mucous 
cell metaplasia-hyperplasia and mucin hyperproduction are poorly understood. Recent studies demonstrated that tumor necrosis 
factor œ (TNF-a), present in human middle ear effusion, stimulated mucin production in vitro and up-regulated mucin gene expres- 
sion in vivo. These findings suggest that TNF-c is important in the development of mucous cell metaplasia-hyperplasia. This study 
demonstrated that inoculation of TNF-a into the middle ear cavity followed by eustachian tube obstruction stimulated mucous cell 
metaplasia-hyperplasia in the middle ear cleft, accompanied. by abundant mucin or mucin-like glycoproteins in the middle ear effu- 
sion — a phenotype of MOM in humans. This finding suggests that TNF-c plays a key role in the pathogenesis of MOM through 
induction of mucous cell metaplasia-hyperplasia and mucin production. 


KEY WORDS — mucin, mucoid otitis media, mucous cell metaplasia, Sprague-Dawley rat, tumor necrosis factor a. 


INTRODUCTION 


Mucoid otitis media (MOM) is a leading cause of 
hearing loss and impairment of language develop- 
ment in children. It is characterized by mucous cell 
metaplasia-hyperplasia in the middle ear cleft accom- 
panied by viscous effusion containing a high concen- 
tration of mucin. The impetus for MOM appears to 
be infectious. Experimental studies in specific path- 
ogen-free (SPF) Sprague-Dawley rats showed that 
inoculation of 1 dose (2.5 x 106 colony-forming units 
per ear) of Streptococcus pneumoniae into the middle 
ear cavity, followed 2 weeks later by eustachian tube 
obstruction (ETO) for 4 weeks, induced thick (mu- 
coid) fluid, whereas ETO alone did not.! Consistent 
with this finding, MOM in humans is frequently pre- 
ceded by acute middle ear infection.” This observa- 
tion suggests that primary middle ear infection may 
be requisite to the development of MOM; however, 
factors that stimulate mucous cell metaplasia-hyper- 
plasia and mucin hyperproduction in the middle ear 
after middle ear infection are poorly understood. 


Many investigators have demonstrated that bacte- 
rial metabolites such as peptidoglycan-polysaccha- 
ridé (PG-PS) and endotoxins (lipopolysaccharide 


[LPS]) are present in middle ear effusion,>-’ especial- 
ly mucoid effusion.’ Both PG-PS and LPS are known 
to be potent inducers of tumor necrosis factor a (TNF- 
a): Himi et al? demonstrated that TNF-a is expressed 
in middle ear effusions of humans with MOM. In 
vivo studies in the chinchilla have shown that inocu- 
lation of pneumococcal cell wall components and 
LPS into the middle ear cavity causes otitis media!® 
and elicits production of TNF-.!! In vitro, TNF-a 
has been shown to stimulate production and secre- 
tion of mucous glycoprotein in chinchilla middle ear 
epithelial cells!? and airway secretory epithelium. !3 
A study in our laboratory demonstrated that TNF-a 
inoculated into the middle ear cavity of Sprague- 
Dawley rats up-regulated mucin gene (Muc2) expres- 
sion in the middle ear cleft.!4 This finding led us to 
hypothesize that TNF-o induced by middle ear patho- 
gens (gram-positive or -negative) plays an impor- 
tant role in the pathogenesis of MOM. The purpose 
of the current study was to determine whether TNF- 
@ is responsible for the development of MOM. 


MATERIALS AND METHODS 


Seventy-seven SPF Sprague-Dawley rats weigh- 
ing 200 to 250 g were used in this study. The rats 
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were examined under an operating microscope be- 
fore the experiment to confirm that their middle ears 
were free of infection. 


Inoculation of TNF-aInto Middle Ear Cavity. Six- 
ty-two rats were anesthetized with ketamine hydro- 
chloride (40 mg/kg) and xylazine hydrochloride (8 
mg/kg), a ventral incision was made, and the mus- 
cles were separated to expose the ventral bullae. A 
tiny hole was made with a 27-gauge needle to equal- 
ize the pressure, and another hole was made to inoc- 
ulate TNF-c. Two hundred fifty nanograms of recom- 
binant human TNF-a (rhTNF-a, R & D Systems 
Company, Minneapolis, Minnesota) in 50 uL of phos- 
phate-buffered saline solution (PBS; 5 ng/mL) was 
inoculated into each middle ear cavity. Five nano- 
grams per milliliter or 250 ng per ear is a relatively 
high concentration of TNF-a compared to that in mid- 
dle ear effusion,’ but this dose was chosen on the ba- 
sis of a previous study.!5 The rationale for using such 
a high dose of TNF-a is that infection causes inflam- 
mation of the middle ear mucosa that may produce 
high amounts of TNF-a in situ. Fifty microliters of 
PBS alone was inoculated into both middle ear cavi- 
ties of an additional 12 rats to serve as controls. The 
holes were sealed with sterile bone wax, and the inci- 
sion was closed. Six of the rats inoculated with TNF- 
œ inoculation were painlessly sacrificed 6 weeks later 
(TNF-a alone group), and 56 underwent ETO (see 
below). Six of the rats inoculated with PBS (PBS 
alone group) were painlessly sacrificed 2 weeks af- 
ter inoculation, and the remaining 6 rats underwent 
ETO (see below). 


ETO After Inoculation of TNF-a. Two weeks af- 
ter the inoculation of TNF-a or PBS, the animals 
were anesthetized with ketamine hydrochloride (40 
mg/kg) and xylazine hydrochloride (8 mg/kg), and 
bilateral ETO was performed by electrocauterization 
by a ventral approach. Fifty-six rats inoculated with 
TNF-o were sacrificed 4 weeks after ETO (TNF-a 
+ ETO group). Six rats with PBS inoculation were 
sacrificed 4 weeks after ETO (PBS + ETO group). 
Eustachian tube obstruction was performed on an ad- 
ditional 6 uninoculated SPF rats, which were sacri- 
ficed 4 weeks later to serve as controls (ETO alone 
group). 

Morphological Study of Middle Ear Cleft. Twelve 
bullae chosen freely from the ears with thick effu- 
sion (TNF-c + ETO), 12 bullae from the PBS + ETO 
group, and 6 bullae from each of the TNF-a alone, 
PBS alone, ETO alone, and untreated SPF rats were 
fixed in 10% formaldehyde and decalcified in 10% 
ethylenediaminetetraacetic acid for light microscopic 
study. Specimens were embedded in paraffin, sec- 
tioned at a thickness of 5 um, and stained with he- 


matoxylin and eosin (H & E), as well as Alcian blue— 
periodic acid—Schiff (AB-PAS). The thickness of the 
epithelium and submucosa was measured under the 
microscope in H & E~stained sections. The distance 
between the apical surface and basement membrane 
was defined as the thickness of the epithelium, and 
the distance between the basement membrane and 
the bony surface was defined as the thickness of the 
submucosa. Epithelial cells in the superior ciliated 
tract that stained positively with AB-PAS were de- 
fined as secretory or mucous cells, and the number 
of cells per high-power field was counted on each 
section of middle ear tissue. 


Three bullae from the TNF-a + ETO group were 
fixed with 2.5% glutaraldehyde (Ted Pella, Tustin, 
California) in 0.1 mol/L phosphate buffer pH 7.4 for 
2 hours at 4°C for microscopic study. After rinsing 
with phosphate buffer, the ciliated tract of the middle 
ear epithelium was dissected out under the micro- 
scope, dehydrated in a graded series of ethanol, and 
embedded in epoxy resin. Semithin sections were 
stained with toluidine blue for determination of tis- 
sue orientation. Ultrathin sections were cut, stained 
with uranyl acetate and lead citrate, and examined 
under a JEOL 1008S electron microscope. Cells with 
light secretory granules and dark cytosol were de- 
fined as secretory or mucous cells. 


Characterization of Middle Ear Fluid. Middle ear 
fluid was collected from the bullae at the time of 
sacrifice. An aliquot of fluid was smeared on glass 
slides, air-dried, and stained immediately with tolui- 
dine blue, and the effusion type was determined on 
the basis of the presence of fibrous strands and the 
number of inflammatory cells in the routine manner. 
Fluid with fibrous strands, but devoid of inflamma- 
tory cells, was defined as mucoid (thick) effusion. 
Cloudy fluid with abundant inflammatory cells was 
defined as purulent effusion, whereas fluid with no 
inflammatory cells or fibrous strands was defined as 
serous (thin) effusion. Effusions of each type were 
pooled, supplemented with 2 mmol/L phenylmethyl- 
sulfonyl fluoride (PMSF; protease inhibitor), and 
stored at —70°C. Middle ears with cloudy fluid were 
considered to be contaminated or infected by bacte- 
ria and thus were excluded from this study. Twenty- 
four middle ear effusions (12 from the TNF + ETO 
group and the other 12 from the PBS + ETO group) 
were routinely cultured for bacteria on blood and 
chocolate agar plates. 


Purification and Quantitation of Mucins or Mu- 
cin-Like Glycoproteins. Purification and quantitation 
of mucins and mucin-like glycoproteins from the bul- 
lae of the TNF-a + ETO group, the PBS + ETO group, 
and the ETO alone group (controls) was carried out 
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as described previously,!© with minor modifications. 
Briefly, middle ear fluid from 72 ears was dissolved 
in extraction buffer containing 2 mmol/L PMSF, 4 
mol/L guanidine hydrochloride, 5 mmol/L ethylene- 
diaminetetraacetic acid (pH 8.0), 50 mmol/L Tris- 
hydrochloride (pH 7.5), and 0.02% sodium azide, ho- 
mogenized briefly, and centrifuged at 13,000 rpm at 
4°C to remove cellular debris. The solution was sup- 
plemented with cesium chloride at 1.4 g/mL and cen- 
trifuged for 72 hours at 1.6 x 10° g at 4°C with the 
brake off. Fractions with a density between 1.30 and 
1.45 g/mL, containing high concentrations of hex- 
ose, were pooled. The pooled solution was supple- 
mented with the above extraction buffer, and the pro- 
cedure was repeated 3 times. Fraction(s) with a den- 
sity of approximately 1.35 to 1.45 g/mL and a hex- 
ose-to-protein ratio of approximately 2:1 were de- 
fined as mucin or mucin-like glycoproteins. After this 
protocol, mucin or mucin-like glycoproteins were 
completely dissociated from other proteins in the 
middle ear effusions and were highly pure. 


RESULTS 


Mucous Cell Metaplasia-Hyperplasia. The H & E- 
stained sections from the middle ear clefts of rats in 
the TNF-œ + ETO group revealed pseudostratified 
or columnar epithelia with strikingly thickened sub- 
mucosa. Light microscopy revealed a significant in- 
crease in the number of mucous cells (Figs 1A and 
2A), compared to the PBS alone, ETO alone, TNF- 
a alone, and PBS + ETO groups (Figs | B-E and 2B- 
E). Mucous cell counts for the various groups are 
presented in the Table. The mucosa and submucosa 
of the TNF-a. + ETO group was significantly thicker 
than that of controls (see Table). Under electron mi- 
croscopy, the epithelia showed abundant secretory 
cells that were lean and shrunken, probably because 
of depletion of secretory granules. Most of the secre- 
tory granules were light and without the characteristic 
dark core of mucin granules. In some mucous cells, 
the apical cytoplasmic portion containing light gran- 
ules had protruded into the middle ear cavity (Fig 3) 
— a morphological sign of degranulation or mucus 
secretion. 


Gross Examination and Bacterial Culture of Mid- 
dle Ear Fluid. In the TNF-a + ETO group, 49 ears 
had thick effusions that contained mucous strands 
on smear (Fig 4A), 20 ears had thin effusions (Fig 
4B), 11 ears had purulent effusions (Fig 4C), and 32 
ears had no effusion. In the PBS + ETO group, 11 
ears had thin, transparent effusions and | ear had pu- 
rulent effusion. In the group that underwent ETO 
alone, 5 ears had thin, transparent effusions with no 
mucous strands (thin effusion), and 1 ear had no ef- 
fusion. None of the rats in the PBS alone or TNF-a 





Fig 1. Alcian blue—periodic acid—Schiff (AB-PAS) stain- 
ing. Mucous cell metaplasia-hyperplasia in middle ear 
cleft of animals treated with tumor necrosis factor œŒ 
(TNF-a) and eustachian tube obstruction (ETO) is shown 
by light microscopy. There was increase in AB-PAS—posi- 
tive cells (mucous cells) in A) TNF-a + ETO and D) 
TNF-a groups, as compared to B) phosphate-buffered 
saline solution (PBS), C) ETO, and E) PBS + ETO 
groups. 
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PBS+ETO 





Fig 2. Hematoxylin and eosin staining (light microscopy). 
Middle ear mucosa was thickened in A) TNF-c + ETO 
and D) TNF-o groups, as compared to B) PBS, C) ETO, 
and E) PBS + ETO groups. Arrow indicates position of 
basement membrane. 


alone groups had effusion at the time of sacrifice. 
Positive bacterial growth was found in 3 specimens: 
1 from the PBS + ETO group and 2 from the TNF-o 
+ ETO group. 


Quantitation of Secreted Mucins From TNF-a + 








MORPHOLOGICAL CHANGES IN MIDDLE EAR CLEFT 
WITH AND WITHOUT TNF-a TREATMENT 











No. of 
Secretory Mucosal Submucosal 

Cells (Cells Thickness Thickness 
Treatment Group per HPF) (um) (um) 
TNF-a + ETO 32.7£7.8* 24.5+6.1* 76.3 +16.3* 
PBS + ETO 24.74 7.6 17.4474 41.7+11.6 
TNF-a 24.3£9.9¢ 13.0+5.1¢ 34.2+10.1+ 
PBS 16.1+7.8 OZER? 17.2: 3.7 
ETO 6.2+3.4 7.04 1.1 2VTERS 
SPF rats 15.6 + 3.2 TIEZ 10.5 £1.1 


TNF-a — tumor necrosis factor œ, HPF — high-power field, ETO 
— eustachian tube obstruction, PBS — phosphate-buffered saline 
solution, SPF — specific pathogen-free. 

*Student’s t-test: p < .05 compared to controls (PBS + ETO, PBS, 
ETO, and SPF rats). 

#Student’s t-test: p < .05 compared to controls (PBS, ETO, and SPF 
rats). 





ETO Group. In the TNF-a + ETO group, middle ear 
effusions contained abundant mucin or mucin-like 
glycoproteins (Fig 5) that had a high density and a 
high ratio of hexose to protein. Fraction 7 demon- 
strated a characteristic mucin or mucin-like glycopro- 
tein with a density of 1.41 g/mL and a ratio of hex- 
Ose to protein of approximately 1.5 to 1. This find- 
ing is characteristic of mucin or mucin-like glyco- 
proteins, which have such a high density and, at the 
same time, a high ratio of hexose to protein. No other 
proteins or glycoproteins in middle ear effusions have 
these features. In the ETO alone group and the PBS 
+ETO group, the middle ear effusions were thin, and 
no mucin or mucin-like glycoproteins were purified 
by this protocol because of the limited volume of the 
middle ear effusion from the two groups and the loss 
of mucin or mucin-like glycoproteins during the pro- 
cedure. 


DISCUSSION 


Mucous cell metaplasia-hyperplasia in the middle 
ear cleft is a cardinal pathological correlate of otitis 
media, especially MOM. It is the histopathologic ba- 
sis for mucin or mucin-like glycoprotein production 
in the middle ear cavity, which contributes to the ac- 
cumulation of intractable middle ear fluid and the de- 
structive process of otitis media, and, ultimately, com- 
promises middle ear function. The factors that modu- 
late mucous cell metaplasia or hyperplasia and mu- 
cous secretion in the middle ear cavity are poorly un- 
derstood. The evidence accumulated to date!-4.7.9.11.17 
suggests that initial infection in the middle ear cleft 
is necessary for the development of MOM. Tumor 
necrosis factor & plays an important role in inflam- 
matory and immune reactions, '*!9 and over-expres- 
sion of TNF-o has been implicated in life-threaten- 
ing states such as shock and wasting.!8-!9 Increased 
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Fig 3. Morphological features of mucous cells and their 
secretory granules under electron microscopy. Mucous 
cells (nonciliated cells; NC) were abundant in TNF-a + 
ETO group, and their appearance was lean and shrunken 
(arrows in A). Granules in mucous cells were light, with- 
out dark cores. Apical portion of cytoplasm of secretory 
cell contains light granules protruding into middle ear 
space (arrow in B), which is sign of degranulation. Mu- 
cous cell appeared to be depleted of granules. C — cili- 
ated cell. 


expression of TNF-a. has been observed in the middle 
ear effusions of humans with MOM? and in experi- 
mental animal models.!! The exact role of TNF-o in 
the middle ear, however, is not currently understood. 
Our recent studies demonstrated that TNF-o stimu- 
lated mucous glycoprotein secretion from chinchilla 
middle ear epithelial cells in vitro!* and mucin gene 
expression in vivo,!*-!5 suggestive of TNF-a involve- 
ment in mucin metabolism. It appears that TNF-a 
plays a key role in the pathogenesis of mucous cell 
metaplasia-hyperplasia and in the development of 





Fig 4. Characteristic features of middle ear effusion 
smears. A) Thick effusion from TNF-a + ETO group con- 
tained fibrous strands with few inflammatory cells. B) 
Thin effusion from TNF-a + ETO group had no fibrous 
strands or inflammatory cells. C) Cloudy effusion from 
TNF-a + ETO group is full of inflammatory cells. 


thick fluid in the middle ear. 


In the current study, we demonstrated that TNF-a 
induced development of thick or mucoid effusion in 
the middle ear cavity of SPF rats at a rate of 43%. 
The thick effusion from this model displayed micro- 
scopic “mucous strands” that are characteristic of 
mucins. Biochemical analysis of the middle ear ef- 
fusions confirmed that middle ear effusion from ani- 
mals in the TNF-a + ETO group contained abun- 
dant mucin or mucin-like glycoproteins. 


Mucin is a glycoprotein consisting of a heavy car- 
bohydrate coat and a lengthy polypeptide backbone 
that is highly glycosylated through O-glycosylation 
on threonine and/or serine residues. The molecular 
makeup is composed of as much as 75% carbohy- 
drates and as little as 25% protein in dry weight. The 
mucin molecule itself is very large: approximately 
12 to 16 kb nucleotides or 4,000 to 5,300 amino acid 
residues. Because of its heavy carbohydrate coat, mu- 
cin has a very high (200 to 600 kd) molecular weight, 
a high density in cesium chloride solution (about 1.4 
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Fig 5. Identification and quantitation of mucin or mu- 
cin-like glycoproteins in middle ear effusions of TNF-a 
+ ETO group. One hexose peak (arrow, fraction 7) was 
identified after 3 spins of effusion in cesium chloride 
solution in which hexose concentration was higher than 
protein concentration. Fraction 7 (arrow) had high den- 
sity of 1.41 g/mL, as well as ratio of hexose to protein of 
approximately 1.5 to 1, (Total mucin or mucin-like gly- 
coproteins were approximately 1,220 ug in 72 ears, or 
16.9 ug per ear.) 


g/mL), and a high viscosity (a property of carbohy- 
drate moiety). Mucin monomers have been shown 
to assemble into a polymer in the rough endoplas- 
mic reticulum,2° and package themselves into gran- 
ules before secretion. Polymerization of mucin dou- 
bles or triples its molecular weight, making it cap- 
able of gel formation. FitzGerald et al?! demonstrated 
a gel property of middle ear mucin glycoprotein when 
concentrated. Polymerized mucins are identified mi- 
croscopically by the characteristic mucoid effusion 
or fibrous strands and by their gel appearance under 
the naked eye. Hutton et al?? demonstrated that the 
thiol residues of mucoid effusion were much higher 
than those of serous effusion, and that loss of the 
disulfide bond between mucin proteins led to loss of 
viscosity,2> suggestive that the disulfide bridge (mu- 
cin polymerization) is an important physical charac- 
teristic of mucin. Whether mucins exist in serous ef- 
fusion, or whether the characteristic “serous” appear- 
ance represents monomers of mucin, is not known at 
the present time. It is possible that there are mucins 
in serous otitis media that differ in nature from the 
mucins in MOM only by their mucin concentration. 
This question needs to be studied in the future with 
animal models and molecular cloning techniques. 


It should be noted that not every rat in the TNF-a 
+ ETO group of this protocol developed thick effu- 
sion. The reason for this finding is unclear, but it 
may be a result of factors such as individual suscep- 
tibility to TNF-a challenge in the middle ear cleft 
caused by genetic differences, failure of ETO, or re- 
opening of the eustachian tube after obstruction. 
Therefore, some of the animals may have developed 


serous effusion because of an inherent resistance to 
TNF-a challenge, developed purulent effusion be- 
cause of an invasion of bacterium into their middle 
ear epithelial cells during the protocol, or developed 
no effusion at all because of inefficacious ETO. 


Morphological study of secretory cells from the 
middle ear cleft of rats in the TNF-a + ETO group 
demonstrated a phenotype similar to that of MOM 
in humans: metaplasia-hyperplasia of mucous cells. 
Most of the granules in these mucous cells appeared 
to be light, with no dark cores, under electron mi- 
croscopy, and were thought to contain mucin or mu- 
cin-like glycoproteins. Jany et al?4 described mucins 
of the respiratory tract as having light granules with 
no dark cores. Lim et al?5 inferred that light granules 
with no dark cores represented mature mucin glyco- 
protein. The identity of these mucins, however, is un- 
known, and further study with molecular biology tech- 
niques is warranted. 


Histochemical analysis of the mucins induced by 
TNF-o inoculation followed by ETO revealed neu- 
tral and acidic glycoproteins that stained violet or pur- 
ple in color with AB-PAS. We, and others, have dem- 
onstrated biochemically2!26 and histochemically27 
that middle ear mucins (glycoconjugates) are rich in 
sialic acids. Mucins (glycoconjugates) identified in 
human temporal bones with MOM have been char- 
acterized as being high in disaccharide residues such 
as NeuNAca(2-3)Gal and NeuNAcB(2-6)GalNAc, 
which contain sialic acids in their terminal ends.27 
Acidic mucins are important physiologically, and 
may play a role in protection of the middle ear epi- 
thelium because of their negative charges. The theory 
is that surface proteins of pathogens are negatively 
charged and therefore unable to bind to the negatively 
charged proteins of the middle ear epithelium. Evi- 
dence supporting this theory includes a study of S$ 
pneumoniae—induced otitis media in chinchillas, 
which found that sialic acids, considered to be im- 
portant for adherence and colonization of middle ear 
pathogens in middle ear epithelia, were absent from 
the infected middle ear.?8 


Proliferation of fibroblasts in the submucosa has 
been identified clinically in humans with MOM and/ 
or chronic otitis media. Interestingly, inoculation of 
TNF-a caused a substantial thickening of.the sub- 
mucosa or outgrowth of fibroblasts in this study, sug- 
gestive that TNF-a may play a role in formation of 
middle ear granulation tissue and subsequent fibro- 
sis, leading to intractable sequelae in the middle ear 
cleft. 


It is known that regular Sprague-Dawley rats (non- 
SPF) and other species develop thick effusion after 
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ETO, because of the presence of pathogens (bacte- 
ria and/or viruses) in the middle ear cavity. The SPF 
rats in the ETO group in this study did not develop 
thick effusion. Goldie et al! found that SPF rats did 
not develop thick effusion unless there was prior chal- 
lenge to the middle ear cleft. Our control animals — 
ie, PBS + ETO, TNF-o alone, PBS alone, and ETO 
alone groups — did not develop thick effusion in the 
middle ear cavity. The animals in the PBS + ETO 
group developed thin effusion and had minor degrees 
of mucous cell metaplasia or hyperplasia in the mid- 
dle ear cleft, whereas those inoculated with TNF-a 
alone developed mucous cell metaplasia-hyperpla- 
sia, but no effusion in the middle ear cavity. Inocula- 
tion with PBS caused visible but minor changes in 
the middle ear cleft, as a result of trauma to the bul- 
lae and the effect of PBS on the mucosa. Eustachian 
tube obstruction alone produced serous effusion in 
the middle ear cavity in this study and in others.! 
Our previous study demonstrated that inoculation of 
TNF-a into the rat middle ear cavity caused mucin 
gene (Muc2) up-regulation in the middle ear cleft,!415 
suggestive that TNF-c may play a role in the devel- 
opment of MOM. These findings, when viewed to- 
gether, suggest that initial challenge to the middle 
ear cleft is essential for the development of MOM, 





and up-regulation of mucin gene expression in the 
middle ear mucosa is key to the pathogenesis of 
MOM in the rat model. The ETO appeared to con- 
tribute to the development of MOM in this study by 
triggering accumulation of mucins in the middle ear 
cavity. The role of ETO in the development of MOM 
in the TNF-o-primed middle ear cleft requires fur- 
ther investigation. 


CONCLUSION 


Inoculation of TNF-a into the middle ear cavity, 
followed by ETO, stimulated accumulation of abun- 
dant mucins or mucin-like glycoproteins in the mid- 
dle ear of SPF Sprague-Dawley rats. Forty percent 
of the ears had thick effusion in the middle ear cav- 
ity, accompanied by mucous cell metaplasia-hyper- 
plasia, a phenotype of MOM in humans. None of the 
controlrats thatreceived PBS inoculation, ETO alone, 
or PBS + ETO developed thick effusion in the middle 
ear cavity, whereas inoculation of TNF-a alone 
caused mucous cell metaplasia-hyperplasia in the 
middle ear cleft. It was concluded that TNF-a plays 
a key role in the pathogenesis of mucous cell meta- 
plasia-hyperplasia, which is the pathological basis 
for mucus hyperproduction in middle ears with MOM. 
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QUININE AFFECTS THE RESPONSE PROPERTIES OF COMPOUND 
ACTION POTENTIALS ELICITED BY PERIODIC CLICK TRAINS 
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The effects of systemically applied quinine on the compound action potential (CAP) were investigated in 5 guinea pigs. A dose 
of 200 mg/kg body weight of quinine hydrochloride was administered intramuscularly, and CAPs were recorded at the round window 
before and after administration. The CAP thresholds of the animals were elevated by 5 to 25 dB approximately 30 minutes after 
administration, and thresholds recovered in some animals during the experimental session. The CAP waveform elicited by the click 
train stimulus was abnormal after administration of quinine. The amplitude of the CAPs elicited by the second click was larger than 
that of those elicited by the first click. These changes may be induced by an abnormally broadened N1 response to the first click in the 
click train following quinine administration. In contrast, the CAP waveforms elicited by the second click and by the following clicks 


in the click train appeared normal. 


KEY WORDS — compound action potential, guinea pig, quinine. 


INTRODUCTION 


Quinine has long been used as an effective antima- 
larial agent. It has become increasingly common for 
clinicians to use this agent for the treatment of noc- 
turnal cramps in the leg muscles. Quinine affects al- 
most every tissue in the body and has been called a 
“protoplasmic poison” when present in high concen- 
tration.! Its side effects commonly include nausea, 
vomiting, and diarrhea. The chronic intake of quinine 
in high doses will produce a condition called cincho- 
nism, which produces symptoms from many organ 
systems (auditory, visual, central nervous, gastroin- 
testinal, respiratory, and circulatory systems), with 
death as the ultimate end point.? Although ototoxic- 
ity is widely known as a potential side effect of qui- 
nine,!-!! the underlying mechanisms are still obscure. 


In humans, the hearing loss caused by quinine is 
closely related to its plasma concentration.” Physio- 
logical tuning curves become shallower, whereas 
stapedial reflex thresholds are unaffected.3 The shift 
of the otoacoustic emission threshold parallels that 
of the pure tone threshold.* Quinine intoxication also 
induces quantitative changes in the otoacoustic emis- 
sions from subjects with hearing impairment.’ Qui- 
nine sulfate is capable of reducing or eliminating 
spontaneous, distortion product, and transient evoked 
otoacoustic emissions.‘ Presumably, these otoacous- 
tic emission abnormalities are the results of outer hair 
cell dysfunction. In addition to disorders of cochlear 


origin, ingestion of a daily 105-mg dose of quinine 
has been reported to induce transient abnormalities 
in positional electronystagmography.° 


Quinine administration in guinea pigs caused a 
comparable threshold elevation in the compound ac- 
tion potential (CAP) input-output function at high 
and low stimulus levels.® The increase in CAP thresh- 
old was closely related to the quinine concentration? 
similar to the effect seen in humans. Jastreboff et al” 
reported apparent tinnitus in a rat behavioral study. 
In addition, they showed the preventing effect of ni- 
modipine, an L-type calcium ion (Ca?+) channel 
blocker, on tinnitus induced by quinine. Histologic 
examination revealed shortening of outer hair cells® 
and changes in strial vessels.? Regarding more central 
aspects of the auditory system, Mulheran!® observed 
significant changes in the distribution of spontaneous 
rates in auditory nerve fibers after the application of 
quinine. The percentage of fibers with a low sponta- 
neous firing rate was found to be about 26% in the 
control group and 47% in the quinine-administered 
group. We have previously reported significant 
changes in both spontaneous firing and temporal cod- 
ing properties in cat cortical cells upon administration 
of quinine hydrochloride at a dose of 100 to 200 mg/ 
kg body weight.!! We suggested that changes in Ca?* 
channel conductance and free intracellular Ca?* con- 
centration result in abnormal synchrony in cortical 
cells. Abnormal neural synchrony was postulated pre- 
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C ims 


Fig 1. Response waveform and measurement methods. 
First 3 ms after onset of click train stimulus is plotted. 
Arrow indicates onset of click train. Value of interclick 
interval (At) was 32 ms, and stimulus intensity was 100 
dB sound pressure level (SPL). Each circle depicts 1 
sampling point. Compound action potential (CAP) am- 
plitude is measured from onset of CAP (minimum point, 
A) to NI peak (maximum point, C). Dotted line indicates 
half-peak. Half-peak width is measured as time between 
first point (B) and last point (D) under half-peak in CAP 
waveform. Half-peak width in this waveform is 0.2 ms. 


viously as a potential correlate of tinnitus.!215 


As mentioned above, the effects of quinine on CAP 
threshold and on more central aspects of auditory 
function have been studied, but the effects on CAP 
adaptation remain obscure. It has been reported that 
CAP adaptation occurs at the level of the synapses 
between hair cells and auditory neurons.!4 Because 
quinine is known to alter ion channel activity, the 
CAP waveform elicited by click train stimuli is as- 
sumed to be altered by quinine, because ion channel 
activity is closely related to synapse function. In this 
paper we examine the effect of quinine on the CAP 
adaptation process. 


MATERIALS AND METHODS 


Animal Preparation, Adult male Hartley guinea 
pigs received an intraperitoneal injection of sodium 
pentobarbital (32.5 to 65 mg/mL) at a dose of 25 
mg/kg body weight. After 15 minutes, the animals 
were given a tracheotomy for artificial respiration. 
After head shaving, Xylocaine (a mixture of lidocaine 
hydrochloride and epinephrine, 10 mg/mL) was in- 
jected subcutaneously, and then a retroauricular skin 
incision was made to expose the round window. The 
left jugular vein was cannulated to compensate for 
the serum volume loss. The guinea pig temperature 
was maintained at 37°C + 1°C with a thermostatically 
controlled unit (BWT-100, Bio Research Ltd, Na- 
goya, Japan). Quinine hydrochloride at a dose of 200 
mg/kg body weight (100 mg/mL) was slowly admin- 
istered intramuscularly. At the end of the experiment, 
the animals were painlessly sacrificed with an over- 
dose of sodium pentobarbital. The care and use of 
the animals reported on in this study was approved 
by the Animal Care and Use Committee of St Mari- 


anna University. 


Acoustic Stimulus Presentation. Sound stimuli 
were presented by a loudspeaker (JBL-2450H) placed 
40 cm from the tested ear. Calibration and monitoring 
of the sound pressure and tonal waveforms was per- 
formed with a condenser microphone (NA-41, Rion 
Ltd, Tokyo, Japan) above the animal’s head. The CAP 
audiograms were determined for short tone bursts of 
2, 4, 8, and 16 kHz. The click trains lasted 64 to 96 
ms, and the intertrain interval was 500 ms. Both stim- 
uli were generated by a masking generator (DPS- 
727, Dia Medical System Co, Ltd, Tokyo) and a stim- 
ulator (DPS-1100D, Dia Medical System Co, Ltd). 
The interclick intervals (At) were 8, 16, 32, and 64 
ms. The number of clicks varied from 2 to 10, de- 
pending on the At because of the limit of the recording 
window: 10 clicks were included when the At was 8 
ms, 7 clicks for 16 ms, 4 clicks for 32 ms, and 2 
clicks for 64 ms. Rectangular electric stimuli of 0.1- 
ms duration were delivered to the transducer to pro- 
duce the acoustic clicks. 


Recording. A silver ball electrode was placed on 
the round window membrane. Electrical responses 
were amplified 10,000 times by a biophysical ampli- 
fier (AVB-11, Nihon Kohden Ltd, Tokyo), bandpass- 
filtered at a frequency from 0.08 Hz to 10 kHz, and 
then fed into a Macintosh computer to be averaged 
by a MacLab system (AD Instruments Pty Ltd, Syd- 
ney, Australia). The activity was sampled at 20 kHz 
for 100 ms after stimulus onset. The recording se- 
quence was composed of 2 parts, using as stimuli short 
tone bursts and click trains with different values of 
the At. After collecting preadministration data as a 
control, we performed the same experimental proce- 
dures for up to 7 hours after administration, without 
changing the electrode positions. 


Data Analysis. The CAP thresholds were deter- 
mined for each short tone burst stimulus. The ampli- 
tude and half-peak width of the response to the click 
train stimulus were analyzed, and the product of these 
2 values represented the peak area of the CAP wave- 
form. Figure 1 illustrates the method used to evalu- 
ate the CAP amplitude and half-peak width. The am- 
plitude was measured from the onset of the CAP to 
the peak of the CAP. The half-peak width was mea- 
sured at a level 50% down from the peak of the CAP. 
Guinea pig 36 was used only to record a waveform 
elicited by the click train stimulus; therefore, the 
threshold was measured in 4 guinea pigs. The statis- 
tical tests used on the data obtained before and after 
quinine administration were mostly based on the Wil- 
coxon signed rank test. Comparisons with the control 
state were made 10 to 75 minutes (43.6 + 14.9 min- 
utes, mean + SD) after administration. Changes at the 
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Fig 2. Relationships between stimulus intensity and each parameter of CAP waveforms. CAP waveforms elicited by first click 
in click train stimulus (At = 64 ms) were used for this analysis. A) Change in CAP amplitude. B) Change in CAP half-peak 
width. C) Change in product of these 2 values, representing peak area of CAP waveform. Means + 1 SE (vertical bar; n = 5) 
are indicated. Six different stimulus intensities were used for this analysis. 


p < .05 level were considered significant. All tests 
were performed with the StatView 5.0J package for 
Windows. 
RESULTS 

Thresholds. The click thresholds in the 4 guinea 
pigs were elevated by 5 to 25 dB approximately 30 
minutes after drug administration, and 2 of the ani- 
mals showed recovery during the recording session. 
The means (+SD) of the threshold elevation to tone 
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Fig 3. Representative response waveforms. Upper 4 traces 
illustrate control waveforms (pre-quinine), and lower 4 
traces reveal experimental waveforms (post-quinine). 
Value of At was 32 ms. Four stimulus intensities from 70 
to 100 dB SPL are indicated on right. 


burst stimulation were as follows: 2 kHz, 10.0+4.1 
dB; 4 kHz, 13.8 +2.5 dB; 8 kHz, 18.8 + 4.8 dB; and 
16 kHz, 23.8 + 2.5 dB. Clearly, the mean threshold 
elevations increased with increasing tone burst fre- 
quency. No statistical test was performed to deter- 
mine the level of significance between frequencies, 
because of the small number of animals (n = 4). 


Relationship Between Stimulus Intensity and CAP 
Waveform. The CAP waveforms elicited by the first 
click in the click train stimulus (At = 64 ms) were 
used for this analysis. The CAP amplitude decreased 
and the half-peak width increased in all animals after 
quinine administration (Fig 2A,B). The product of 
these 2 values, representing the peak area of the CAP 
waveform, decreased after quinine administration 
(Fig 2C). These differences in CAP amplitude, half- 
peak width, and peak area between the preadmini- 
stration and postadministration conditions were all 
statistically significant (p < .05) at all 6 intensity lev- 
els. Taking the threshold elevation into account, we 
compared the CAP waveforms at adjusted intensities 
(eg, if the threshold was elevated by 10 dB, the CAP 
waveforms seen after administration were evaluated 
at an intensity of 100 dB when compared with those 
seen before administration at 90 dB). Even after this 
adjustment, there was still a statistically significant 
(p < .05) difference, both in amplitude and in half- 
peak width, between the preadministration and post- 
administration conditions. In contrast, the peak area 
after administration of quinine did not differ statis- 
tically compared with that before administration. 


CAP Adaptation Process. Figure 3 depicts repre- 
sentative CAP waveforms elicited by the click train 
stimulus. before and after quinine administration. The 
value of the At was 32 ms. The CAP adaptation pro- 
cess was analyzed by use of normalized values for 
each parameter relative to those to the last click. The 





426 


4 





normalized amplitude 








1 2 3 4 5 6 7 
A click number 





normalized width 





1 2 3 4 5 6 7 
B click number 


changes in the normalized values of each parameter, 
including CAP amplitude, half-peak width, and peak 
area as a function of click number in the click train, 
are shown in Figs 4A, 4B, and 4C, respectively. The 
value of the At was 16 ms, and 4 different intensities 
were used for this analysis. Before quinine admin- 
istration, the normalized CAP amplitude and peak 
area gradually decreased with the click number, and 
reached the steady state around the sixth click (Fig 
4A and Fig 4C, respectively), whereas the normalized 
half-peak width gradually increased until it was satu- 
rated at about the sixth click (Fig 4B). In contrast, 
after quinine administration, the CAP amplitude elic- 
ited by the second click increased markedly and then 
_ gradually decreased (Fig 4A). This abnormal response 
pattern was also observed in the half-peak width after 
quinine administration. The width decreased in re- 
sponse to the second click in the train stimulus and 
then increased afterward (Fig 4B). In contrast, 
changes in the peak area after quinine administration 
were somewhat obscure (Fig 4C). The patterns were 
different, depending on stimulus intensity. The 
changes seen at other values of the At showed almost 
the same patterns as those in Fig 4. 


. Relationship Between Response Elicited by First 
Click and That Elicited by Second Click. As reported 
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Fig 4. Changes in CAP adaptation process. Nor- 
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above, the CAP elicited by the first click was abnor- 
mally broad, but an almost normal CAP was elicited 
by the second click (Figs 2-4). Figure 5 shows the 
relationship between stimulus intensity and each pa- 
rameter of the CAP waveform elicited by the second 
click. The CAP amplitude and peak area decreased 
after quinine administration (Fig 5A,C). These differ- 
ences in CAP amplitude and peak area between the 
preadministration and postadministration conditions 
were Statistically significant (p < .05) at stimulus in- 
tensities of 70 dB and 90 dB. However, the statistical 
results were different depending on the value of the 
At at stimulus intensities of 80 dB and 100 dB. In 
contrast, the CAP width after administration of qui- 
nine did not differ statistically from that seen before 
administration at any intensity levels (Fig 5B). 


Figure 6 reveals the ratio of each parameter of the 
CAP elicited by the second click to that of the CAP 
elicited by the first click, as a function of At. Before 
quinine administration, the ratio for the CAP ampli- 
tude gradually increased with increases of At, where- 
as the ratio for the width gradually decreased (Fig 
6A,B). After quinine administration, the ratio for the 
CAP amplitude increased and then decreased. The 
highest value of the ratio for the CAP amplitude oc- 
curred 16 to 32 ms after the first click (Fig 6A), This 
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abnormal pattern was also observed in the CAP width 
after quinine administration. The width decreased and 
then increased afterward. The lowest value of the 
ratio for the width also occurred during this time pe- 
riod (Fig 6B). In contrast, the relationships between 
the ratio for the CAP peak area and At were somewhat 
obscure both before and after quinine administration 
(Fig 6C). 


DISCUSSION 


We observed threshold elevations of 5 to 25 dB in 
all animals approximately 30 minutes after quinine 
administration. In human studies, the maximum oto- 
toxic effect of quinine occurs 3 to 4 hours after in- 
gestion of quinine at a dose of 5 to 15 mg/kg body 
weight, and the hearing returns to normal within 30 
hours.? The hearing loss was flat for frequencies 
higher than 250 Hz, but the effect of the drug was 
less marked at 125 and 250 Hz.3 In an animal elec- 
trophysiological study, quinine caused threshold ele- 
vation of the CAPs.® In the central auditory system, 
we previously observed threshold elevations of 10 
to 40 dB 30 minutes after administration of quinine 
hydrochloride at a dose of 100 to 200 mg/kg body 
weight, and some animals showed a recovery of 
threshold during the experimental session.!! Further- 
more, we also observed changes in the tuning prop- 
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erties of cortical cells, with the tip of the tuning curve 
shifting toward the lower frequencies. This observa- 
tion suggests that the threshold elevation at the higher 
frequencies is somewhat larger than that at the lower 
frequencies. This finding is compatible with our CAP 
results in the present study. 


The CAP input-output function exhibited a reduc- 
tion of amplitude at both low and high intensities. 
This outcome is consistent with those of previous 
reports. The CAP half-peak width increased, while 
the peak area decreased, after quinine administra- 
tion. Changes in both CAP amplitude and CAP half- 
peak width were observed even after adjusting the 
threshold elevation. However, the changes in CAP 
peak area disappeared after adjustment, suggesting 
that the number of nerve fibers contributing to the 
CAPs were not altered after quinine administration. 
These findings suggest that the main cause of the 
abnormal broadening of the response elicited by the 
first click was a decrease in the synchronization of 
individual single-fiber action potentials (APs), rather 
than a broadening of the individual single-fiber APs. 
Since synchronization of APs may be strongly de- 
pendent on the function of the L-type Ca?+ channel 
at the base of the inner hair cells,!> changes in this 
channel are a likely cause of this abnormal broaden- 
ing. We have previously suggested that an increase 
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in the intracellular availability of Ca?+ might induce 
tinnitus.!! Eleno et al!5 observed increased 45Ca up- 
take by quinine in isolated mast cells, and suggested 
that quinine opens certain Ca?*+ channels and leads 
to elevated intracellular Ca?* levels. 


In addition to changes in Ca?+ channel conduc- 
tance, it has been suggested that the inactivation prop- 
erties of the potassium ion (K*) current provide a 
direct explanation for the cumulative broadening of 
APs,!5.16 Cumulative K+ current inactivation is sup- 
posed to result in a gradual broadening of CAPs dur- 
ing repetitive stimulation. Quinine is known to be a 
K+ channel blocker, and is reported to block the Ca?*- 
activated K* currents in red blood cells,!” pancreatic 
cells,!8 and GH3 cells.!9 It is not impossible that qui- 
nine affects the Ca? channel indirectly by interfering 
with K+ channels in the hair cells. Calcium and K+ 
channels are closely related to each other, and it might 
sometimes be difficult to specify which is responsible 
for an outcome. 


The overall time course of the CAP amplitude re- 
duction to trains of clicks before quinine administra- 
tion was similar to that reported previously. 1420 After 
administration of quinine, an abnormal response to 
the click train stimulus was observed. The broadening 
of the N1 response to the first click, rather than 





Fig 6. Ratios of each parameter of CAP wave- 
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changes in the CAP adaptation process, appeared to 
be responsible for this abnormal response. The am- 
plitude of the response elicited by the first click de- 
creased, and its half-peak width increased, after qui- 
nine administration. However, after the response 
elicited by the second click, the adaptation process, 
including both reduction of amplitude and broaden- 
ing of CAPs, returned to the preadministration pat- 
tern. This finding suggests that the changes in Ca?+ 
and/or K* channels were either reset or recovered to 
the normal state after the second click in the train. It 
thereby suggests that the first stimulus mobilizes the 
Ca?* entry into the hair cell so as to produce normal 
synchronization for the CAPs elicited by the second 
click. One expects that this effect would be in compe- 
tition with the depression caused by the previous 
stimulus. By evaluating the relationship between the 
responses elicited by the first click and those elicited 
by the second click, we found that the critical duration 
for this competition occurred 16 to 32 ms after the 
presentation of the first click. This normal or super- 
normal response to the second click was observed 
even:at the largest value of At (64 ms). It is therefore 


regrettable that At values larger than 64 ms were not _ 


used in the present study; a future study using larger 
At values would clarify how long this abnormal phe- 
nomenon continues during click train stimulation. 
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ANAEROBIC BACTERIA IN UPPER RESPIRATORY TRACT AND OTHER 
HEAD AND NECK INFECTIONS 


ITZHAK BROOK, MD, MSc 
WASHINGTON, DC 


Anaerobic bacteria are common in chronic upper respiratory tract and head and neck infections. Anaerobes are the most pre- 
dominant components of the normal human oropharyngeal bacterial flora, and are therefore a common cause of bacterial infections 
of the upper respiratory tract that are of endogenous origin. Because of their fastidious nature, anaerobes are difficult to isolate from 
infectious sites and are often overlooked. Anaerobic bacteria can be recovered in chronic otitis media and sinusitis, and play a role in 
tonsillitis. They are also important in complications of these infections. Anaerobes predominate in deep oral and neck infections and 
abscesses. In addition to their direct pathogenicity in these infections, they possess an indirect role through their ability to produce 
the enzyme B-lactamase. In this fashion, they are capable of “shielding” non—B-lactamase—producing bacteria from penicillins. The 
lack of directing adequate therapy against these organisms may lead to clinical failures. Their isolation requires appropriate methods 
of collection, transportation, and cultivation of specimens. Treatment of anaerobic infections is complicated by the slow growth of 
these organisms, by their polymicrobial nature, and by the growing resistance of anaerobic bacteria to antimicrobials, Antimicrobial 
therapy is often the only form of therapy required, whereas in other cases, it is an important adjunct to a surgical approach. Because 
anaerobic bacteria generally are recovered mixed with aerobic organisms, the choice of appropriate antimicrobial agents should 
provide for adequate coverage of both types of pathogens. 


KEY WORDS — anaerobic bacteria, otitis, sinusitis, tonsillitis, upper respiratory tract infection. 


INTRODUCTION Sutterella), gram-positive cocci (primarily Pepto- 
Anaerobic bacteria are common in upper respira- Streptococcus spp), gram-positive, spore-forming ba- 
tory tract infections (URTIs) and head and neck in- cilli (Clostridium spp), non-spore-forming bacilli 
fections of a chronic nature, in which they may be (Actinomyces, Propionibacterium, Eubacterium, 
serious and life-threatening. The importance of anaer- Lactobacillus, and Bifidobacterium), and gram-nega- 
obes in these infections is due to their predominance tive cocci (mainly Veillonella).>-° All of these organ- 
in the normal human oropharyngeal mucous mem- isms, with the exception of Clostridium spp, can be 
brane bacterial flora.!:2 These organisms are there- recovered from URTIs and head and neck infections. 
fore a common cause of bacterial infections of endog- Mixed infections caused by numerous aerobic and 
enous origin. Because of their fastidious nature, they anaerobic organisms are observed commonly in clini- 
are difficult to isolate and are often overlooked. Their cal situations.>4 


exact frequency is difficult to ascertain from the med- 
ical literature because of the inconsistent use of ade- 
quate methods for their isolation and identification. 
Their isolation requires appropriate methods of col- 


The taxonomy of anaerobic bacteria has recently 
changed because of their improved characterization 
by genetic studies.>:’ The ability to differentiate be- 
tween similar strains enables better characterization 


: capa > of types of infection and predicted antimicrobial sus- 
mens.3> Treatment of these infections is complicated 


: : ceptibility. 
by the slow growth of these organisms, by their poly- j , , 
microbial nature, and by the growing resistance.of Pigmented Prevotella (Prevotella melaninogenica 
anaerobic bacteria to antimicrobials. and Prevotella intermedia), Porphyromonas asaccha- 


rolytica, nonpigmented Prevotella (Prevotella oralis, 
Prevotella oris), and Fusobacterium nucleatum are 
part of the normal oral flora and are the predominant 
gram-negative anaerobic species isolated from URTIs 


This review describes the role of anaerobic bacte- 
ria in URTIs and other head and neck infections and 
describes the management of these infections. 


ANAEROBES RECOVERED IN UPPER and their complications and other head and neck in- 
RESPIRATORY TRACT INFECTIONS AND OTHER fections. The growing resistance of gram-negative 
HEAD AND NECK INFECTIONS anaerobic bacilli to penicillins has been noticed in 

The clinically important anaerobes (Table 1) are 6 the past decade. These include pigmented Prevotel- 
genera of gram-negative rods (Bacteroides, Prevo- la spp and Porphyromonas spp, Fusobacterium spp, 
tella, Porphyromonas, Fusobacterium, Bilophila, and and-P oralis. The main mechanism of resistance is 
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TABLE 1. ANAEROBIC BACTERIA MOST FREQUENTLY ENCOUNTERED IN URTIs AND 
HEAD AND NECK INFECTIONS 








Organism 


Infectious Sites 





Gram-positive cocci 
Peptostreptococcus spp 
Microaerophilic streptococci* 

Gram-positive bacilli ` 
Non-spore-forming 

Actinomyces spp 

Propionibacterium acnes 

Bifidobacterium spp 
Spore-forming 

Clostridium perfringens 

Clostridium difficile 

Clostridium ramosum 

Gram-negative bacilli 
Bacteroides fragilis group 
Pigmented Prevotella and Porphyromonas 
Prevotella oralis 
Prevotella oris-buccae 
Fusobacterium nucleatum 
Fusobacterium necrophorum 

URTIs — upper respiratory tract infections. 
*Not obligate anaerobes. 


Bacteremia 


Respiratory tract, deep neck, and soft tissue infections 
Sinusitis, brain abscesses 


Intracranial abscesses, chronic mastoiditis, head and neck infections 
Infections associated with foreign bodies 
Chronic otitis media, cervical lymphadenitis 


Soft tissue infections 
Colitis, antibiotic-associated diarrheal disease 
Soft tissue infections 


Chronic otitis and sinusitis (rare) 

Orofacial and deep neck infections, periodontitis 

Orofacial infections 

Orofacial infections 

Orofacial, deep neck, and respiratory tract infections, brain abscesses, bacteremia 





through the production of the enzyme f-lactamase. 


Anaerobic streptococci (Peptostreptococcus spp) 
predominate in all types of URTIs and their compli- 
cations.? They generally are recovered mixed with 
other aerobic or anaerobic organisms, but in many 
cases, including those with bacteremia, they are the 
only pathogens recovered. Microaerophilic strepto- 
cocci are of particular importance in chronic sinusi- 
tis and brain abscesses. 


PATHOGENICITY AND VIRULENCE 


Anaerobes as Normal Flora. The human body mu- 
cosal and epithelial surfaces are colonized with aero- 
bic and anaerobic microorganisms.!2 The organisms 
at the different sites tend to belong to certain major 
bacterial species, and their presence in that location 


is predictable. The relative and total counts of orga- 


nisms can be affected by various factors, such as age, 
diet, anatomic variations, illness, hospitalization, and 
antimicrobial therapy. However, the predictable pat- 
terns of bacterial flora-temain stable through life, 


despite their subjection to perturbing factors. Anaer-. 


obes outnumber aerobic bacteria in all mucosal sur- 
faces, including the oropharynx, and certain orga- 
nisms predominate in the different sites. 


The microflora of the upper airways, including the 
oral cavity, nasopharynx, and oropharynx, is com- 
plex and contains many kinds of obligate anaerobes. 
The ratio of anaerobic bacteria to aerobic bacteria in 


saliva is approximately 10:1. The total count of an- 
aerobic bacteria in the saliva and elsewhere in the 
oral cavity reaches 107/mL to 108/mL. 


Knowledge of the composition of the oral flora is 
useful for predicting which organisms may be in- 
volved in an infection adjacent to that site and can 
assist in the selection of an empiric antimicrobial 
therapy, even before the exact microbial cause of the 
infection is known. 


Anaerobes belonging to the indigenous flora of 
the oral cavity can be recovered from various infec- 
tions adjacent to that area, such as cervical lymphad- 
enitis; subcutaneous abscesses and burns in proxim- 
ity to the oral cavity; human and animal bites; paro- 
nychia; tonsillar and retropharyngeal abscesses; 
chronic sinusitis; chronic otitis media; mastoiditis; 
periodontal abscess; thyroiditis; aspiration pneumo- 
nia; lung abscess; and bacteremia associated with one 
of the above infections (Fig 1).4° The predominant 
anaerobes recovered in these infections are Prevo- 
tella, Porphyromonas, P oralis, Fusobacterium, and 
Peptostreptococcus spp, which are all part of the nor- 
mal flora of the mucous surfaces of the oropharynx 
(Table 1). 


Conditions Predisposing to Anaerobic Infection. 
The clinical situations that predispose to anaerobic 
infections include exposure of the sterile body sites 
to a large inoculum of indigenous mucous membrane 
flora. Poor blood supply and tissue necrosis lower 
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Fig 1. Oropharyngeal flora as source of infection. 


the oxidation-reduction potential and favor the growth 
of anaerobes. Any condition that lowers the blood 
supply to an affected area can predispose to anaero- 
bic infection. Therefore, trauma, foreign body, malig- 
nancy, surgery, edema, shock, and vascular disease 
may predispose to anaerobic infection. Previous in- 
fection with aerobic or facultative organisms also 
may make the local tissue conditions more favor- 
able for the growth of anaerobes. The immune de- 
fense mechanisms also may be impaired by anaero- 
bic conditions and anaerobic bacteria. 


Suppuration, abscess formation, thrombophlebi- 
tis, and gangrenous destruction of tissue associated 
with gas formation are the hallmarks of anaerobic 
infection. Anaerobes are especially common in chron- 
ic infections, and they are commonly seen after un- 
successful therapy with antimicrobials that fail to 
eradicate them (ie, aminoglycosides, trimethoprim- 
sulfamethoxazole, and the older quinolones). 


Certain infections are very likely to involve anaer- 
obes as important pathogens, and the presence of an- 
aerobes should always be assumed. Such infections 
include chronic URTIs and head and neck infections. 


Virulence Factors. Anaerobes contribute to the se- 
verity of infection through their synergy with their 
aerobic counterparts and with each other.!° Anaero- 
bic bacteria require more time than do aerobic bac- 
teria to become virulent, because some of the major 
virulence factors of certain anaerobes (ie, the produc- 
tion of a capsule by anaerobic gram-negative bacilli) 
are expressed only after the infection has become 
chronic. !! 


Anaerobes possess several important virulence 
factors, including the presence of surface structures 
such as a capsule of polysaccharide or lipopolysac- 
charide, and the production of superoxide dismutase 
and catalase, immunoglobulin proteases, coagula- 
tion-promoting and -spreading factors (such as hy- 
aluronidase, collagenase, and fibrinolysin), and ad- 
herence factors. !? Other factors that enhance the viru- 
lence of anaerobes include mucosal damage, an oxi- 
dation-reduction potential drop, and the presence of 
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Fig 2. Protection of penicillin-susceptible bacteria from 
penicillin by B-lactamase—producing bacteria. 


hemoglobin or blood in an infected site. 


An indirect pathogenic role of some anaerobes is 
their ability to produce the enzyme B-lactamase. B- 
lactamase—producing bacteria (BLPB) can be in- 
volved directly in the infection and protect not only 
themselves, but also other penicillin-susceptible or- 
ganisms, from the activity of penicillins (Fig 2). This 
protection can occur when the enzyme is secreted 
into the infected tissue or abscess fluid in sufficient 
quantities to degrade the B-lactam ring of penicillin 
or cephalosporin before the drug can kill the suscep- 
tible bacteria. !3 


MICROBIOLOGY DIAGNOSIS 


The proper management of anaerobic infection 
depends on appropriate documentation of the bacte- 
ria causing the infection. Without such an approach, 
the patient may be exposed to inappropriate, costly, 
and undesirable antimicrobial agents with adverse 
side effects. Certain or all of the anaerobes may not 
be recovered when the specimen is not promptly 
placed under anaerobic conditions for transport to 
the laboratory. 


The essential elements requiring the cooperation 
of the physician and the microbiology laboratory for 
appropriate documentation of anaerobic infection are 
the collection of appropriate specimens, expeditious 
transportation, and careful laboratory processing. 


Appropriate cultures for anaerobic bacteria are es- 
pecially important in mixed aerobic and anaerobic 
infections.> Techniques or media that are inadequate 
for isolation of anaerobic bacteria can lead to the as- 
sumption that the aerobic organism(s) recovered are 
the sole pathogens. This assumption may cause the 
clinician to direct therapy toward only those aerobic 
organisms. 


Specimens should be obtained free of contamina- 
tion, so that normal flora organisms are excluded. 
Because indigenous anaerobes often- are present on 
the surfaces of skin and mucous membranes in large 
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TABLE 2. METHODS FOR COLLECTION OF SPECIMENS FOR ANAEROBIC BACTERIA IN URTIs AND 
HEAD AND NECK INFECTIONS 








Infection Site 


Methods 





Abscess or body cavity 


Aspiration by syringe and needle 


For incised abscesses, syringe or swab (less desirable) 
Specimen obtained during surgery after cleansing of skin 
Surgical specimen by tissue biopsy or curette 


Sinuses or mucous surface abscesses 


Aspiration after decontamination or surgical specimen 


Ear Aspiration after decontamination of ear canal and membrane; in perforation, cleansing 
of ear canal and aspiration through perforation 





numbers, even minimal contamination of a specimen 
with normal flora can give misleading results. Speci- 
mens should therefore be designated according to their 
acceptability for anaerobic culture to the acceptable 
or the unacceptable category. Appropriate specimens 
for anaerobic cultures should be obtained by a tech- 
nique that bypasses the normal flora (Table 2). Un- 
acceptable or inappropriate specimens can yield nor- 
mal flora also and therefore have no diagnostic val- 
ue. 


Respiratory tract specimens are difficult to obtain 
without contamination with indigenous flora. Accept- 
able head and neck specimens include blood speci- 
mens; aspirates of abscess, middle ear, or sinus con- 
tents; and deep wound aspirates. Specimens may be 
collected after thorough decontamination of the skin, 
mucous membrane, or external ear canal. 


Specimens should be transported to a microbiol- 
ogy laboratory promptly in a transport device that is 
oxygen-free. Specimens should be placed into an an- 
aerobic transporter as soon as possible after their col- 
lection. Aspirates of liquid specimen or tissue are al- 
ways preferred to swabs. Liquid specimens may be 
inoculated into an anaerobic transport vial. A plastic 
or glass syringe and needle also may be used for 
transport. After collection, all air should be expelled 
from the syringe. No more than 30 minutes should 
elapse before the specimen is plated, because air grad- 
ually diffuses into the plastic syringe. 


UPPER RESPIRATORY TRACT AND OTHER HEAD 
AND NECK INFECTIONS AND THEIR 
COMPLICATIONS 


Anaerobic bacteria can be isolated from a variety 
of URTIs and other head and neck infections and 
predominate in their chronic form. These URTIs in- 
clude chronic otitis media, sinusitis, and mastoidi- 
tis; tonsillar, peritonsillar, and retropharyngeal ab- 
scesses; all deep neck space infections; thyroiditis; 
odontogenic infections; and postsurgical and nonsur- 
gical head and neck wounds and abscesses (Table 
3). 


Dental Infections. Most dental infections involve 
anaerobes. These include pulpitis and endodontal and 


periodontal (gingivitis and periodontitis) infections, 
periapical and dental abscesses, and perimandibular 
space infection.!4.!5 Pulpitis may progress to an ab- 
scess and eventually involve the mandible and other 
neck spaces. In addition to the above-mentioned or- 
ganisms, microaerophilic streptococci and Strepto- 
coccus Salivarius can also be involved. 


Vincent’s angina (or trench mouth) is a distinct 
form of ulcerative gingivitis. The causative organisms 
include Fusobacterium spp and anaerobic spiro- 
chetes; however, definitive studies by anaerobic mi- 
crobiological methods remain to be performed. 


Otitis Media. Anaerobes have been isolated in 5% 
to 15% of patients with acute otitis media!® and in 
42% of culture-positive aspirates of patients with se- 
rous Otitis media.!7 The predominant isolates in acute 
and serous otitis were Peptostreptococcus spp and 
Propionibacterium acnes. Gram-negative anaerobic 
bacilli were also found in serous otitis media. Per- 
sistent otitis media that fails to respond to antimicro- 
bial therapy can lead to chronic infection that is as- 
sociated with emergence of resistant anaerobic and 
aerobic bacteria. This pattern was recently demon- 
strated in a series of 7 children who had repeated 
aspirations of middle ear fluid through their perfo- 
rated eardrums over a period of 36 to 55 days.!8 


Anaerobes have been recovered in about 50% of 
patients with chronic suppurative otitis media‘6.19,20 
and of those with cholesteatoma.”!.22 The variability 
in the rate of isolation of anaerobes in these studies 
may be attributed to differences in geographic loca- 
tions and laboratory methods. The predominant an- 
aerobes recovered were gram-negative bacilli and 
Peptostreptococcus spp, and the aerobes were Pseu- 
domonas aeruginosa and Staphylococcus aureus. 
Anaerobic bacteria were generally recovered mixed 
with aerobic bacteria, and the number of isolates per 
specimen ranged between 2 and 6. Many of these or- 
ganisms can produce f-lactamase, and they may have 
contributed to the high failure rate of B-lactam anti- 
biotics in the therapy of this infection. Greater suc- 
cess in eradicating these infections can be achieved 
with antimicrobial therapy directed at the eradica- 
tion of anaerobic bacteria, as compared to therapies 
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TABLE 3. AEROBIC AND ANAEROBIC BACTERIA ISOLATED IN URTIs AND HEAD AND NECK INFECTIONS 








Type of Infection 


Aerobic and Facultative Organisms 


Anaerobic Organisms 





Acute otitis media 


Chronic otitis media and mastoiditis 


Peritonsillar and retropharyngeal abscesses 


Recurrent tonsillitis 


Suppurative thyroiditis 


Acute sinusitis 


Chronic sinusitis 


Streptococcus pneumoniae 


Haemophilus influenzae* 
Moraxella catarrhalis* 
Staphylococcus aureus* 
Escherichia coli* 
Klebsiella pneumoniae* 


Pseudomonas aeruginosa* 


Peptostreptococcus spp 
Streptococcus pyogenes 
Staphylococcus aureus* 
S pneumoniae 
Streptococcus pyogenes 
H influenzae* 

S aureus* 

S pyogenes 

S aureus* 

H influenzae* 

S pneumoniae 

M catarrhalis* 

S aureus* 

S pneumoniae 

H influenzae* 


Peptostreptococcus spp 


Pigmented Prevotella and Porphyromonas spp 
Bacteroides spp* 
Fusobacterium spp* 


Fusobacterium spp* 
Pigmented Prevotella and Porphyromonas spp* 


Fusobacterium spp* 


Pigmented Prevotella and Porphyromonas spp* 


Peptostreptococcus spp __ - 


Pigmented Prevotella and Porphyromonas spp* 
Fusobacterium spp* 


Cervical lymphadenitis S aureus* 


Mycobacterium spp 
Postoperative infection disrupting oral mucosa Staphylococcus spp* 


Pigmented Prevotella and Porphyromonas spp* 
Peptostreptococcus spp 
Fusobacterium spp* 


Enterobacteriaceae* Bacteroides spp* 
Staphylococcus spp* Pigmented Prevotella and Porphyromonas spp* 
Peptostreptococcus spp 
Deep neck infection Streptococcus spp Bacteroides fragilis group 
Staphylococcus spp* Pigmented Prevotella and Porphyromonas spp* 


Odontogenic complications 


Oropharyngeal Vincent’s angina 
Necrotizing ulcerative gingivitis 
*Organisms that have potential of producing B-lactamase. 


Streptococcus spp 
Staphylococcus spp* 


Streptococcus spp 
Staphylococcus spp* 


Peptostreptococcus spp* 

Pigmented Prevotella and Porphyromonas spp* 
Peptostreptococcus spp 

Fusobacterium spp* 

Fusobacterium necrophorum* 

Spirochetes 





less effective against these organisms.?3 


Anaerobic bacteria are often isolated from infected 
cholesteatomas.”!,22 Cholesteatoma, which often ac- 
companies chronic suppurative otitis media, can en- 
hance the absorption of bone, which is accelerated 
by organic acids produced by anaerobic bacteria.?? 
Because cholesteatoma associated with chronic sup- 
purative otitis media contains bacteria similar to those 
isolated from chronically infected ears, the cholestea- 
toma may serve as a nidus of the chronic infection. 
Anaerobes were isolated from 23 of 24 (96%) speci- 
mens of chronic mastoiditis,2* and from most patients 
with intracranial abscesses that complicated chronic 
suppurative otitis media.+6 


Sinusitis. The most common pathogens in the acute 
stage of sinusitis are similar to those recovered in 


acute otitis media: Streptococcus pneumoniae, Hae- 
mophilus influenzae, and Moraxella catarrhalis. An- 
aerobic organisms become involved as the infection 
turns chronic and the levels of tissue oxygen decline.?5 
Although anaerobes are generally isolated from only 
about 10% of patients with acute sinusitis (generally 
in maxillary sinusitis secondary to periodontal infec- 
tion), they can be isolated from up to 67% of patients 
with chronic irifection.26 An average of 3 anaerobes 
per sinus aspirate were recovered in patients with 
chronic sinusitis.26 Persistent sinusitis that fails to 
respond to antimicrobial therapy can lead to chronic 
infection that is associated with emergence of resis- 
tant anaerobic and aerobic bacteria. The gradual 
emergence of such flora was demonstrated in a se- 
ries of 5 patients who had repeated endoscopic aspi- 
rations of maxillary sinus fluid over a period of 34 
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to 50 days.2’ 


One study”® demonstrated elevated levels of anti- 
body to F nucleatum and P intermedia in patients 
with chronic maxillary sinusitis, suggesting a poten- 
tial pathogenic role for these organisms. The median 
antibody levels for F nucleatum and P intermedia 
were significantly lower in patients who were cured 
at days 75 to 90, as compared to the levels on the 
day of diagnosis (p < .05). However, the levels stayed 
elevated in patients whose infection did not improve. 
Antimicrobial therapy directed at the eradication of 
gram-negative anaerobic bacteria was superior to 
therapies less effective against these organisms.?9 


The infection may spread via anastomosing veins 
or spread contiguously to the central nervous sys- 
tem. Complications include orbital cellulitis, menin- 
gitis, cavernous sinus thrombosis, and epidural, sub- 
dural, and brain abscesses.46.30 


Tonsillitis. Indirect evidence supports the involve- 
ment of anaerobes in acute and chronic tonsillitis. 
The evidence is derived from studies that show the 
major role of anaerobes in complications of tonsilli- 
tis. The organisms associated with the infection are 
Fusobacterium spp, gram-negative anaerobic bacilli, 
and Peptostreptococcus spp. Polymicrobial aerobic 
and anaerobic flora predominate in peritonsillar and 
retropharyngeal abscesses.4631 These organisms can 
be isolated from 25% of suppurative cervical lymph 
nodes and are mostly associated with the presence 
of dental or tonsillar infections.32 Anaerobes have 
been associated with thrombophlebitis of the inter- 
nal jugular veins, which often causes postanginal sep- 
sis.46 


The pathogenic role of anaerobes in the acute in- 
flammatory process in the tonsils is also supported 
by several clinical observations: their recovery in ton- 
sillar, peritonsillar, or retropharyngeal abscesses in 
many cases without any aerobic bacteria,?! the iso- 
lation of anaerobes from tonsils in Vincent’s angi- 
na,*6 the recovery of encapsulated pigmented Prevo- 
tella and Porphyromonas spp in acutely inflamed ton- 
sils, the isolation of anaerobes from the core of recur- 
rently inflamed tonsils with non—group A B-hemoly- 
tic streptococci (GABHS),?3 and the response to an- 
timicrobials in patients with non-GABHS tonsilli- 
tis.34.35 Furthermore, an immune response against P 
intermedia can be detected in patients with non- 
GABHS tonsillitis3®; and an immune response can 
also be detected against P intermedia and F nucle- 
atum in patients who have recovered from peritonsil- 
lar cellulitis or abscesses?’ and infectious mononu- 
cleosis.°8 


Therapy with metronidazole alleviated the symp- 
toms of tonsillar hypertrophy and shortened the dura- 


tion of fever in patients with infectious mononucleo- 
sis.34 Because metronidazole has no antiviral or aero- 
bic antibacterial efficacy, suppression of the oral an- 
aerobic flora may contribute to diminishing the in- 
flammation induced by the Epstein-Barr virus. This 
explanation is supported by the increased recovery 
of P intermedia and F nucleatum during the acute 
phases of infectious mononucleosis.3? 


Anaerobes have been isolated from the cores of 
tonsils of children with recurrent GABHS2640 and 
non-GABHS tonsillitis.33 B-lactamase—producing 
aerobic and anaerobic bacteria were recovered from 
75% of tonsils of children with recurrent GABHS ton- 
sillitis!3.4041 and from 40% of those with non-GABHS 
tonsillitis.>3 Similar organisms were recovered in the 
adenoids of patients with adenoiditis, adenoid hyper- 
trophy, and recurrent otitis media.4? 


Recurrent pharyngotonsillitis and penicillin fail- 
ure to eradicate GABHS can be a serious clinical prob- 
lem. One explanation for penicillin failure is that re- 
peated administrations result in selection of BLPB.!3 
The recovery of these bacteria in more than three 
quarters of patients with recurrent GABHS tonsil- 
litis,!3:40,41 the ability to measure B-lactamase activ- 
ity in the core of these tonsils,43 and the response of 
patients to antimicrobials effective against BLPB 
(ie, clindamycin or amoxicillin plus clavulanic 
acid)!3,26.44.45 support the role of these aerobic and 
anaerobic BLPB in the failure of penicillin to eradi- 
cate GABHS tonsillitis. 


Parotitis. Acute suppurative parotitis is generally 
caused by S$ aureus or Streptococcus spp, or, rarely, 
aerobic gram-negative bacteria. Anaerobes, mostly 
Peptostreptococcus, Bacteroides, and pigmented 
Prevotella and Porphyromonas spp, have also been 
recognized as an important cause of this infection.“ 


Cervical Lymphadenitis. The most common causes 
in children are viruses. The organisms that cause 
acute unilateral infection associated with facial trau- 
ma or impetigo are S aureus and GABHS. Cat-scratch 
and mycobacterial inféctions are important in chronic 
infections. Anaerobic bacteria have been isolated in 
about 25% of cases of cervical lymphadenitis, often 
in pure culture?2; the predominant anaerobes were 
Fusobacterium and Peptostreptococcus spp. The re- 
covery of anaerobes was often associated with a pri- 
mary dental, periodontal, or tonsillar infection. 


Thyroiditis. Anaerobic bacteria such as anaerobic 
gram-negative bacilli and Peptostreptococcus spp 
have been identified as causative agents in thyroidi- 
tis.4647 Fikenella corrodens and: Actinomyces spp 
have also been reportedz... ṣo 


Deep Neck Infections: Deep neck infections gen- 
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erally follow oral, dental, and pharyngeal infections ` 


and are generally polymicrobial, involving the anaer- 
obes that caused the primary infections. They involve 
infections in the deep posterior neck spaces (retropha- 
ryngeal, prevertebral, and visceral vascular), suprahy- 
oid spaces (pharyngomaxillary, submandibular, man- 
dibular, masticator, temporal, parotid, and peritonsil- 
lar), and infrahyoid space. Mediastinitis following 
perforation of the esophagus or extension of retropha- 
ryngeal abscess or cellulitis, or abscess of dental ori- 
gin, is most likely to involve mixed aerobic-anaerobic 
infection.48:49 


Infected cysts (thyroglossal duct cysts, cystic hy- 
gromas, branchial cleft cysts, laryngoceles, and der- 
moid cysts) can become inflamed and cause local 
infection. The organisms that can cause secondary 
infection of these cysts can originate from either the 
skin or the oropharynx and therefore include oral an- 
aerobes.50 


Wound Infection After Head and Neck Surgery. 
These infections are related to the exposure of the 
surgical site to the oropharyngeal flora and the com- 
promise of the surgical site. Postsurgical head and 
neck wounds, especially those following head and 
neck surgery, are generally infected by polymicro- 
bial aerobic and anaerobic flora; the number of iso- 
lates varies from | to 9 (average, 6).>' The most fre- 
quently recovered isolates are Peptostreptococcus 
spp, S aureus, anaerobic gram-negative bacilli, Fuso- 
bacterium spp, and enteric gram-negative rods. The 
presence of polymicrobial flora in postsurgical 
wounds warrants the use of antimicrobials effective 
against these organisms in the prophylaxis and ther- 
apy of these infections.°? 


Intracranial Complications of URTIs. Upper res- 
piratory tract infections caused by anaerobic bacte- 
ria can cause a variety of intracranial complications. 
These include brain abscess, subdural empyema, epi- 
dural abscess, and meningitis. The main source of 
brain abscess is an adjacent, generally chronic infec- 
tion in the ears, mastoids, sinuses, oropharynx, teeth, 
or lungs (Fig 1).5 Ear or mastoid infection tends to 
spread to the temporal lobe or cerebellum, whereas 
sinusitis often causes abscess of the frontal lobe. The 
route of extension is contiguous, hematogenous (gen- 
erally through retrograde thrombophlebitis), or 
through the lymphatics. 


The anaerobic bacteria generally recovered from 
brain abscesses that complicate respiratory and den- 
tal infections include Prevotella, Porphyromonas, 
Bacteroides, Fusobacterium, and Peptostreptococ- 
cus spp. Microaerophilic and other streptococci are 
also often isolated. Actinomyces is less frequently 


encountered. 


MANAGEMENT 


The recovery from an anaerobic infection depends 
on prompt and proper management. The principles 
guiding the management of anaerobic infections in- 
clude neutralizing the toxins produced by the anaer- 
obes, preventing the local proliferation of the anaero- 
bic bacteria by changing the environment, and ham- 
pering the spread of the bacteria into healthy tissues. 


Environmental control is achieved by debriding 
necrotic tissue, draining pus, improving circulation, 
alleviating obstructions, and increasing the tissue oxy- 
genation. Certain types of adjunct therapy such as 
hyperbaric oxygen may also be useful. Antimicrobi- 
als’ primary role is in limiting the local and systemic 
spread of the organism. Antimicrobial therapy is in 
many patients the only form of therapy required, 
whereas in others, it is supplemented by a surgical 
approach. 


Surgical Therapy. In many cases, surgical therapy 
is the most important and sometimes the only form 
of treatment required, whereas in others, surgical 
therapy is an important adjunct to a medical approach. 
Surgery is important in draining abscesses, debrid- 
ing necrotic tissues, decompressing closed-space in- 
fections such as sinuses, relieving obstructions, and 
correcting the underlying disorder. When surgical 
drainage is not performed, the infection may persist, 
and serious complications may develop. 


Antimicrobial Therapy. Appropriate management 
of mixed aerobic and anaerobic infections requires 
the administration of agents effective against both 
types of organisms. A number of factors should be 
considered in choosing appropriate antimicrobial 
agents. They should be effective against all target 
organism(s), induce little or no resistance, achieve 
sufficient levels in the infected site, have a safety 
record and appropriate dosage schedules, and have 
minimal toxicity and maximum stability. 


Antimicrobials may fail to cure the infection. 
Among the reasons are the development of bacterial 
resistance, achievement of insufficient tissue levels, 
incompatible drug interaction, and the development 
of an abscess. The environment of an abscess is det- 
rimental to many antibiotics. The abscess capsule in- 
terferes with the penetration of drugs, and the low 
pH and the presence of binding proteins or inactivat- 
ing enzymes (ie, B-lactamase) may impair their ac- 
tivity. The low pH and the anaerobic environment 
within the abscess are especially unfavorable for the 
aminoglycosides and quinolones. An acidic pH, high 
osmolarity, and an anaerobic environment can also 
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TABLE 4. SUSCEPTIBILITY TO ANTIMICROBIAL AGENTS OF ANAEROBIC BACTERIA PRESENT IN URTIs AND 








HEAD AND NECK INFECTIONS 
Penicillin Ureido- 
and B- . and 


Lactamase Carboxy- 


Chloram-  Clinda- Metroni- Carba- 





Bacteria Penicillin Inhibitor Penicillin Cefoxitin phenicol mycin Macrolides dazole penems 
Peptostreptococcus spp 4 4 3 3 3 3 2-3 2 3 
Fusobacterium spp 1-3 3-4 3 3 3 2-3 1 3 3 
Bacteroides fragilis group 1 4 2-3 3 3 3-4 1-2 4 4 
Prevotella spp and 

Porphyromonas spp 1-3 4 2-3 3 3 3-4 2-3 4 4 
Clostridium perfringens 4 4 3 3 3 3 3 3 3 
Clostridium spp 3 3 3 2-3 3 2 2 3 3 
Actinomyces spp 4 4 3 3 3 3 3 1 3 


Degrees of activity: 1 — minimal; 2 — moderate; 3 — good; 4 — excellent. 





develop in an infection site in the absence of an ab- 
scess. 


In choosing antimicrobials to treat mixed infec- 
tions, their aerobic and anaerobic antibacterial spec- 
trum (Table 4) and their availability in oral or paren- 
teral form should be considered. Some antimicrobi- 
als have a limited range of activity. For example, met- 
ronidazole is active only against anaerobes and there- 
fore cannot be administered as a single agent for the 
therapy of mixed infections. Others (ie, imipenem) 
have wide spectra of activity against Enterobacteri- 
aceae and anaerobes. 


Antimicrobial selection is simplified when reli- 
able culture results are available. However, reliable 
culture may be difficult to achieve because of the 
problems in obtaining appropriate specimens in an- 
aerobic infections. For this reason, many patients are 
treated empirically on the basis of suspected, rather 
than known, pathogens. Fortunately, the types of an- 
aerobes involved in many anaerobic infections and 
their antimicrobial susceptibility patterns tend to be 
predictable, although they may vary in a particular 
hospital. Some anaerobic bacteria, however, have be- 
come resistant to antimicrobial agents or may be- 
come so while a patient is receiving therapy. 


Factors other than susceptibility patterns also in- 
fluence the choice of antimicrobial therapy. These 
include the pharmacological characteristics of the 
various drugs, their toxicity, their effect on the nor- 
mal flora, and bactericidal activity. Although identi- 
fication of the infecting organisms and their antimi- 
crobial susceptibility may be needed for selection of 
optimal therapy, the clinical setting and Gram’s stain 
preparation of the specimen may suggest what types 
of anaerobes are present in the infection, as well as 
the nature of the infectious process. 


Because anaerobic bacteria generally are recov- 
ered mixed with aerobic organisms, selection of prop- 


er therapy becomes more complicated. In the treat- 
ment of mixed infection, the choice of the appropri- 
ate antimicrobial agents should provide for adequate 
coverage of most of the pathogens. 


Antimicrobial Agents. Some classes of agents pos- 
sess poor activity against anaerobes. These include 
the aminoglycosides, the monobactams, trimetho- 
prim-sulfamethoxazole, and the older quinolones. 
Antimicrobials suitable to control anaerobic infec- 
tions are summarized in Table 4 and discussed in 
more detail below.5455 


Penicillin G is the drug of choice against most 
non--lactamase-producing bacteria. These include 
anaerobic streptococci, Clostridium spp, nonsporu- 
lating anaerobic bacilli, and most non-B-lactamase— 
producing gram-negative anaerobic rods. However, 
in addition to the B fragilis group, which is known 
to resist the drug through production of B-lactamase, 
many other anaerobic gram-negative rods are show- 
ing increased resistance. These include Fusobacte- 
rium, pigmented Prevotella, and Porphyromonas, 
which are common in orofacial and respiratory in- 
fections. 


The combination of -lactamase inhibitors (eg, 
clavulanic acid) plus a B-lactam antibiotic (eg, amox- 
icillin, ticarcillin) can overcome this mechanism of 
resistance. Other forms of resistance are alterations 
in the porin canal through which the antimicrobial 
penetrates into the bacteria and changes in the peni- 
cillin-binding proteins. 


The semisynthetic penicillins carbenicillin, ticar- 
cillin, piperacillin, and mezlocillin are generally ad- 
ministered in large quantities to achieve high serum 
concentrations. These drugs have good activity 
against gram-negative enterics and most anaerobes 
in these concentrations. However, these drugs are not 
entirely resistant to the B-lactamase produced by 
gram-negative anaerobic bacilli. 
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The activity of parenteral cephalosporins against 
gram-negative anaerobic bacilli varies. Most strains 
of the B fragilis group and many Prevotella, Porphy- 
romonas, and Fusobacterium strains are resistant to 
first-generation cephalosporins because of cephalo- 
sporinase production. Cefoxitin is relatively resistant 
to this enzyme and is the most effective cephalosporin 
against the B fragilis group, and is often used for the 
therapy and prophylaxis of mixed infections. Cefo- 
tetan and cefmetazole have a longer half-life and are 
as effective as cefoxitin against B fragilis, but have 
poor efficacy against other members of the B fragilis 
group (ie, Bacteroides thetaiotaomicron). Third-gen- 
eration cephalosporins are inferior to cefoxitin against 
Bacteroides spp. All oral cephalosporins have poor 
activity against §-lactamase—producing anaerobes. 


Carbapenems (imipenem, meropenem) have excel- 
lent activity against a broad spectrum of aerobic bac- 
teria and anaerobic bacteria, including B-lactamase— 
producing anaerobes, Enterobacteriaceae, and Pseu- 
domonas. 


Chloramphenicol has excellent in vitro activity 
against most anaerobes, and resistance is rare. The 
drug also is effective against many Enterobacteria- 
ceae and gram-positive cocci. The rare but fatal aplas- 
tic anemia, the dose-dependent leukopenia, gray syn- 
drome in newborns and patients with impaired he- 
patic glucuronidation, limit its use. 


Clindamycin and lincomycin are effective against 
anaerobes and have good activity against aerobic 
gram-positive cocci. Clindamycin has the broader 
coverage against anaerobes, including B-lactamase— 
producing ones. Antibiotic-associated colitis due to 
Clostridium difficile was first described as follow- 
ing clindamycin therapy. However, colitis has been 
associated with other antimicrobials. 


Metronidazole has excellent activity against anaer- 
obes; however, it is not effective against aerobic bac- 
teria. Microaerophilic streptococci, P acnes, and Acti- 
nomyces spp often are also resistant. Concern was 
raised about the carcinogenic and mutagenic effects 
of this drug; however, these effects were shown only 
in 1 species of mice and were never substantiated in 
other animals or humans.*6 


Macrolides (erythromycin, azithromycin, clarithro- 
mycin) have moderate to good in vitro activity against 
anaerobic bacteria other than B fragilis and Fuso- 
bacterium spp. They are active against Prevotella and 
Porphyromonas spp, microaerophilic and anaerobic 
streptococci, gram-positive non—spore-forming an- 
aerobic bacilli, and certain Clostridium spp. They 
show relatively good activity against Clostridium per- 
fringens and show poor or inconsistent activity against 


gram-negative anaerobic bacilli. 


Glycopeptides (vancomycin, teicoplanin) are ef- 
fective against all gram-positive anaerobes (includ- 
ing C difficile), but are inactive against anaerobic 
gram-negative bacilli. 


The newer tetracycline analogs, doxycycline and 
minocycline, are more active than the parent com- 
pound. The use of tetracyclines is not recommended 
in patients younger than 8 years of age because of 
their adverse effect on teeth and bone. 


The older quinolones (ciprofloxacin, ofloxacin) are 
less active than the newer ones (trovafloxacin, moxi- 
floxacin, gatifloxacin) against anaerobic gram-nega- 
tive bacteria. However, the use of the quinolones is 
limited in growing children because of their possible 
adverse effects on cartilage. 


Choice of Antimicrobial Agents. The parenteral 
antimicrobials that can be used in most URTIs and 
head and neck infections are clindamycin, metroni- 
dazole, chloramphenicol, cefoxitin, a penicillin (eg, 
ticarcillin) plus a B-lactamase inhibitor (eg, clavu- 
lanic acid), and a carbapenem (ie, imipenem). An 
anti-gram-negative enteric agent (ie, aminoglyco- 
side) is generally added to clindamycin, metronida- 
zole, or, occasionally, cefoxitin in treating mixed in- 
fections that include gram-negative aerobic bacte- 
ria. Penicillin is added to metronidazole in the therapy 
of intracranial, pulmonary, and dental infections to 
cover microaerophilic streptococci, Actinomyces spp, 
and Arachnia spp. A macrolide (ie, erythromycin) can 
be added to metronidazole in URTIs to treat $ aureus 
and aerobic streptococci. Penicillin is added to clin- 
damycin to supplement its coverage against Pepto- 
streptococcus spp and other gram-positive anaerobic 
organisms. 


Penicillin is still the drug of choice for bacteremia 
caused by non—f-lactamase—producing bacteria. 
However, other agents should be used for the therapy 
of bacteremia caused by BLPB. 


The duration of therapy for strictly anaerobic in- 
fections, which are often chronic, is generally longer 
than that for infections due to aerobic and faculta- 
tive anaerobes. Oral therapy is often substituted for 
parenteral therapy after an initial period. The agents 
available for oral therapy are limited and include 
amoxicillin plus clavulanic acid, clindamycin, chlor- 
amphenicol, and metronidazole. 


Clinical judgment, personal experience, safety, and 
patient compliance should direct the physician in the 
choice of the appropriate antimicrobial agents. The 
duration of treatment also must be individualized, 
depending on the response. 
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EXTRAESOPHAGEAL REFLUX IN PATIENTS WITH CONTACT 
GRANULOMA: A PROSPECTIVE CONTROLLED STUDY 


RIITTA YLITALO, MD, PHD 


STIG RAMEL, MD, PHD 


STOCKHOLM, SWEDEN 


The aim of this study was to compare the incidences of extraesophageal reflux in patients with contact granuloma and healthy 
controls. A 24-hour ambulatory pH monitoring technique was used to measure reflux parameters in the pharynx and distal esophagus. 
Pharyngeal acid reflux events occurred in 17 of 26 granuloma patients (1 to 20 episodes per patient) and 5 of 19 controls (1 to 8 
episodes per subject). The reflux episodes were typically short and occurred predominantly in an upright position. A comparison 
between the groups showed a significant difference in the number of pharyngeal reflux episodes (p = .009) and in the total time of pH 
below 4 (p = .006). On the other hand, we found no significant differences in any esophageal reflux parameters, except for the 
percentage of distal esophageal reflux episodes that reached the pharynx (p = .006). In this study, pharyngeal acid exposure was 
significantly more prevalent in patients with contact granuloma than in healthy controls. 


KEY WORDS — 24-hour dual-probe pH monitoring, contact granuloma, extraesophageal reflux, healthy control. 


INTRODUCTION 


Contact granuloma is a relatively rare granuloma- 
tous inflammation of the posterior glottis that occurs 
mostly in men. The most common symptom is hoarse- 
ness combined with persistent throat clearing, sore 
throat, and globus sensation. Despite its infrequent 
occurrence, contact granuloma has received consid- 
erable attention in the literature because of frustra- 
tion encountered in its clinical management. The heal- 
ing time is long, regardless of treatment, and the re- 
currence rate after surgery is greater than 90%.! It is 
not known whether contact ulcer and contact granu- 
loma are separate entities or different stages of the 
same pathological process, but the terms are often 
used interchangeably. 


The association between laryngeal disease and gas- 
troesophageal reflux (GER) was first suggested in 
1968, by Cherry and Margulies,” who reported on 3 
patients with contact ulcer of the larynx and signifi- 
cant esophageal reflux on barium swallow studies. 
They suggested that laryngeal overspill of acid was 
a supplementary factor to vocal abuse in contact ul- 
cer formation. Delahunty and Cherry? supported this 
theory by the experimental induction of laryngeal 
granulomas. 


The first series of patients with contact granulo- 
mas evaluated for GER by modern diagnostic meth- 
ods, such as barium esophagography, esophageal ma- 
nometry, and pH monitoring, was reported in 1983 
by Ohman et al.4 In their study, 74% of patients had 
abnormal reflux. Furthermore, Svensson et al> found 





that 9 of their 13 patients with idiopathic granuloma 
had esophagitis, GER, or motility disturbances. Gold- 
berg et aló and Ward et al’ were the first to document 
the benefit of antireflux therapy in contact ulcer pa- 
tients. Nowadays, extraesophageal reflux (EER) is 
thought to be the cause of a variety of otolaryngologi- 
cal symptoms and disorders. Reflux into the phar- 
ynx has been previously documented in patients with 
chronic hoarseness,’ posterior laryngitis,!°!2 vocal 
fold nodules,!3 subglottic stenosis,!° and laryngeal 
cancer.!0 However, the frequency and pattern of EER 
in contact granuloma patients have not been sys- 
tematically studied, and only a few studies have in- 
cluded pH monitoring in the pharynx in healthy con- 
trols, 10.12.13 


Ambulatory 24-hour pH monitoring is currently 
the best single method for diagnosis of gastroesoph- 
ageal reflux disease (GERD).!4-!6 Combined esoph- 
ageal and pharyngeal pH monitoring offers the op- 
portunity to document not only GER, but also EER. 
Some studies have shown that many patients in oto- 
laryngological practice have a reflux pattern that is 
distinctly different from the pattern seen in patients 
with reflux esophagitis.!° Pharyngeal reflux is mainly 
occult in these patients and may therefore lead to a 
long delay in the diagnosis and treatment of the un- 
derlying disease. 


The aims of the present study were to determine 
and compare the incidences of GER and EER in pa- 
tients with contact granuloma and in normal control 
subjects. 
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MATERIAL AND METHODS 


Subjects. From February 1997 to February 1999, 
28 consecutive patients with contact granuloma were 
studied. Patients with contact ulcer or those with 
granuloma of known etiology, such as intubation, 
were excluded. Technically adequate registrations 
could be obtained in 26 of the 28 patients. These 26 
patients (24 men and 2 women), 37 to 74 years of 
age (mean, 52 years), all nonsmokers, were included 
in the study. The granuloma patients reported the 
presence of the following symptoms: sore throat 
(81%), hoarseness (54%), frequent clearing of the 
throat (50%), and heartburn and/or regurgitation 
(48%). 


During the study period, we also evaluated 20 vo- 
cally healthy volunteers recruited by advertisement. 
One of them was excluded from the later data analy- 
sis, because the patient was unable to tolerate ambu- 
latory pH monitoring for a full 24 hours. The remain- 
ing 19 healthy volunteers (16 men and 3 women) 
were 20 to 51 years of age (mean, 32 years). None of 
the healthy volunteers were taking drugs known to 
alter esophageal motor function or gastric acid se- 
cretion. They had no history of heartburn, acid regur- 
gitation, or epigastric pain. No upper gastrointestinal 
tract endoscopy or barium swallow studies were per- 
formed before the study. No controls had a history 
of alcohol abuse, diabetes mellitus, or vascular or 
neurologic disease. Only | of them smoked. At laryn- 
goscopic examination, all controls had normal laryn- 
geal mucosa or changes that were not considered 
pathological. 


The study was approved by the Ethical Committee 
for Human Research at Huddinge University Hospi- 
tal. 


Laryngoscopic Examination. The patients under- 
went a video laryngoscopic examination between 3 
and 6 weeks before the manometry and pH monitor- 
ing, whereas the healthy controls underwent all the 
examinations on the same day. The subjects were 
examined either with a rigid telescope (Storz 8706 
CJ, 70°) or with a flexible fiberscope (Olympus ENF- 
P3, 3.5 mm). The endoscopes were connected to a 
light source (Briiel & Kjær 4914 or Wolf 5052) and 
to video equipment (Panasonic GIE camera; Pan- 
asonic AG-7350 Super-VHS video recorder; Sony 
CVM-1810E monitor). The subjects were anes- 
thetized in the nose with naphazoline hydrochloride— 
lidocaine hydrochloride (0.17 mg/mL and 3.4 mg/ 
mL) as needed. To confirm the diagnosis, all of the 
recordings were later examined by another experi- 
enced phoniatrician. 


Manometry. The granuloma patients and the healthy 
controls initially underwent esophageal manometry. 


The locations of the upper esophageal sphincter (UES) 
and the lower esophageal sphincter (LES) were de- 
termined, and the resting pressure of the LES (milli- 
meters of mercury) and esophageal peristaltic activ- 
ity were evaluated. We used a low-compliance pneu- 
mohydraulic capillary infusion system (Arndorfer 
Medical Specialties Inc, Greendale, Wisconsin) and 
an 8-lumen polyvinyl catheter (Synectics Medical 
Inc, Stockholm, Sweden; diameter, 4.5 mm), with 
the 3 distal orifices placed at the same level and the 
others placed 5 cm apart. The catheter was perfused 
with distilled water at a rate of 0.5 mL/min, and pres- 
sures were registered with Gould P50-150 trans- 
ducers. Data were collected and analyzed with Poly- 
gram v 4.60 C2 (Synectics Medical Inc). The ma- 
nometry catheter was inserted transnasally into the 
stomach with the patient in a sitting position, and 
the manometry was performed with the patient in a 
supine position. A stationary pull-through technique 
was used to record the LES resting pressure in the 3 
distal orifices. The catheter was withdrawn in 1-cm 
increments into the esophagus. The data for each pa- 
tient represent the mean of 2 individual pull-throughs, 
measured from end-expiration gastric baseline to the 
highest end-expiration esophageal sphincter pressure. 
The normal resting pressure for the LES at our labo- 
ratory is 10 to 30 mm Hg. As the catheter was with- 
drawn, esophageal peristalsis was evaluated on the 
basis of the mean values of 10 wet swallows (5 mL) 
given at 30-second intervals. The UES pressure was 
monitored during 3 dry swallows. Thus, we deter- 
mined the distance between the LES and the UES and 
could choose a pH probe suitable for each patient. 
The esophageal manometry was performed after at 
least 4 hours of fasting. 


Ambulatory 24-Hour pH Monitoring. After ma- 
nometry, the subjects underwent ambulatory 24-hour 
dual-probe pH monitoring with 2 monocrystalline 
antimony pH probes positioned along a single cath- 
eter (Synectics Medical Inc; diameter, 2.1 mm). The 
2 recording sites were spaced individually and lo- 
cated 15 to 24 cm apart. A silver—silver chloride cu- 
taneous reference electrode was attached to the skin 
on the anterior part of the chest. Data were collected 
with a portable data logger (Digitrapper Mark III, 
Synectics Medical Inc) with a sampling rate of 4 sec- 
onds. Both probes were calibrated simultaneously in 
buffer solutions of pH 7 and pH 1 before each test. 
We placed the pH probes at 2 manometrically deter- 
mined sites: the proximal sensor was positioned 1 
cm proximal to the oral end of the UES, and the dis- 
tal sensor. 5 cm above the oral end of the LES. The 
probes were secured with tape to the nose. The sub- 
jects were encouraged to eat their regular meals with 
restrictions for intake of drink or food with a pH of 
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less than 4. They were also asked to abstain from 
smoking and drinking alcohol. All subjects kept a 
detailed diary indicating the time of oral intake, the 
changing of position (upright and supine), and symp- 
tomatic events such as heartburn or regurgitation. H2- 
receptor antagonists and proton-pump inhibitors were 
discontinued for a minimum of 10 days before the 
study. After the 24-hour monitoring period, the probe 
was removed and the data were transferred to a com- 
puter for the analysis with EsopHogram 5.70C2 
(Synectics Medical Inc). In addition, pH recordings 
were displayed on-screen for a detailed manual analy- 
sis and determination of the temporal relationship of 
the pH declines registered at the 2 probe sites. 


Definition of Reflux Episodes. For both sites, a de- 
crease in pH to less than 4 that was not induced by 
eating or drinking was considered the beginning of a 
reflux episode, and the following rise to a pH of great- 
er than 5 was considered the end of such an episode. 
For the pharyngeal probe, the decrease had to be 
abrupt and simultaneous with or preceded by a de- 
crease in pH of similar or larger magnitude in the 
esophageal probe to be accepted as a pharyngeal re- 
flux event. Determination of the temporal relation- 
ship between the onset of pH declines at both sites 
was performed. This differentiated pH declines in- 
duced by oral intake, aerodigestive tract residue and 
secretions, pharyngeal probe movement, or loss of 
mucosal contact in which the pharyngeal pH decline 
preceded esophageal pH drops from true EER epi- 
sodes. The information recorded on the data logger 
was compared with the information provided in the 
diary at the completion of the study. Calculation and 
evaluation of all recordings were done by one of the 
authors (R.Y.). 


The results of the ambulatory pH study regarding 
reflux to the distal esophagus were considered abnor- 
mal if the pH variables exceeded 2 SD from the mean 
derived from the asymptomatic normal subjects stud- 
ied in our laboratory. These values for esophageal 
monitoring were 7% upright, 3% supine, and 6.5% 
total. 


Using esophageal manometry and 24-hour pH 
monitoring, we determined 1) the LES resting tone 
and the sphincter’s responses to swallowing and the 
presence of esophageal dysmotility, 2) the number 
of pH declines to less than 4 in the distal esophagus 
and pharynx, 3) the percentage of esophageal reflux 
episodes that reached the pharynx, and 4) the acid 
exposure and average acid clearing times at the 2 
recording sites. The total acid exposure time was de- 
fined as the time during which the distal esophagus 
or pharynx was exposed to acid. The percent acid 
exposure time was calculated as the percentage of 


total time during which the pH probe was exposed 
to a pH of less than 4. The average acid clearance 
time was derived by dividing the total acid exposure 
time in minutes by the number of reflux episodes at 
that site. The data from the proximal and distal pH 
sensors taken during supine and upright periods were 
analyzed independently. Statistical comparison of the 
LES resting pressures between the groups was per- 
formed with the t-test, because the data were normally 
distributed. All other comparisons were performed 
with the nonparametric Mann-Whitney test because 
of skewed distributions of pH data. 


RESULTS 


Normative Data. Pharyngeal acid exposure events 
occurred in 5 of the 19 healthy controls: 3 subjects 
had 1 episode each, 1 subject had 2 episodes, and 1 
had 8 episodes during the 24-hour monitoring. All 
except 2 of these reflux episodes were registered in 
an upright position. 


Manometric Findings. All healthy volunteers and 
21 of the 26 patients with granuloma had normal 
esophageal peristalsis. Four granuloma patients with 
GERD had low-amplitude peristaltic pressure waves, 
and 1 had repetitive peristaltic contractions. The av- 
erage LES resting pressure was 20 mm Hg (SD, 7 
mm Hg) for the granuloma patients and 17 mm Hg 
(SD, 4 mm Hg) for the healthy controls. Both the 
granuloma patients and the healthy controls showed 
normal LES and UES relaxation in response to swal- 
lowing. 


pH Monitoring in Distal Esophagus. Acid episodes 
in the distal esophagus were identified in all sub- 
jects, indicating adequate registration. Half of the 
granuloma patients (13/26) had normal esophageal 
pH monitoring results, and the other half had patho- 
logical pH results. Of the 19 healthy volunteers, 3 
had esophageal pH recordings indicating increased 
acid reflux. The number of reflux episodes, total acid 
exposure time, percent acid exposure time, and acid 
clearance time showed considerable intersubject vari- 
ability in each group. A comparison between granu- 
loma patients and controls showed no significant dif- 
ferences in any esophageal reflux parameters, except 
in the percentage of distal reflux episodes that reached 
the pharynx (p = .006). Table 1 summarizes the pH 
data from the distal esophagus. 


pH Monitoring in Pharynx. Pharyngeal acid re- 
flux occurred in 17 of the 26 patients with granulo- 
ma. The number of pharyngeal reflux episodes in this 
group ranged from 1 to 20. However, of the healthy 
volunteers, 3 had 1 episode, 1 had 2 episodes, and 
another had 8 episodes of pharyngeal reflux. Of all of 
the pharyngeal reflux episodes, 93% occurred in the 
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TABLE 1. RESULTS OF ESOPHAGEAL 24-HOUR PH 
MONITORING OF GRANULOMA PATIENTS AND 


TABLE 2. RESULTS OF PHARYNGEAL 24-HOUR PH 
MONITORING OF GRANULOMA PATIENTS AND 























HEALTHY CONTROLS HEALTHY CONTROLS 
PH Variables Granuloma Healthy PH Variables Granuloma Healthy 
in Esophagus Patients Controls p in Pharynx Patients Controls p 
Total No. of Total No. of episodes 2 (0-18) 0 (0-8) .009 
episodes 48 (13-114) 38 (5-91) .168 Upright 2 (0-20) 0 (0-6) 006 
Upright 44 (13-103) 34 (5-87) .260 Supine 0 (0-4) 0 (0-2) 750 
Supine 3 (0-37) 3 (0-16) -267 Longest episode (min) 0.1 (0-3.3) 0 (0-0.7) 009 
Episodes >5 min 2 (0-14) 1 (0-10) .066 Total time pH < 4 
Longest episode A (min) 0.2 (0-9.6) 0.0 (0-1) .006 
(min) 14 (1-83) 5 (0.1-158) 093 % Time pH < 4 total 0.02 (0-0.9) 0.0 (0-0.1) 008 
Total time pH < 4 Acid clearance time 
(min) 63 (5-233) AL(1=388) 021 (min) 0.1 (0.1-2.6) 0.0 (0-0.7) 017 
Upright 55 (4-237) 25 (1-138) -129 Data are medians; 5th and 95th percentiles are given in parentheses. 
Supine 2 (0-141) 2 (0-308) 312 ~ i _ a 
% ksar pH<4 aiat yoan Pen 2) patients with normal distal esophageal monitor- 
A ee EEES ` ing results but reflux episodes to the pharynx (n = 
Upright 6.0 (0.6-27) 3.0 (0.06-17) .198 g . . . . 
: 9); 3) patients with abnormal distal esophageal moni- 
Supine 0.4 (0-35) 0.4 (0-38) .269 : è 
i toring results but no reflux episodes to the pharynx 
Total acid clearance Os anda ients with Peat 
time (min) 1.4 (0.3-4) 0.9 (0.1-6) 081 (n = 5); and 4) patients with abnormal monitoring 
Upright 1.1 (0.3-3) 0.8 (0.1-3) 115 results in the distal esophagus and reflux episodes to 
Supine 0.7 (0-22) 0.6 (0-19) 396 the pharynx (n = 8; Table 3). The 2 female patients 
% Episodes that with granuloma had normal pH monitoring results 
reached pharynx 6 (0-29) 0 (0-9) .006 in the distal esophagus, as well as in the pharynx. 
LES resting 
pressure DISCUSSION 
(mm Hg; mean : ; : P >a 
FSD) aaa 1744 192 The data in this study confirm that EER is more 


Except for lower esophageal sphincter (LES) resting pressure, data 
are medians; Sth and 95th percentiles are given in parentheses. 
“Episodes” — events in which pH < 4. 





upright position, and 3 subjects, 2 from the granulo- 
ma group and | healthy control, had pharyngeal re- 
flux episodes in both the upright and supine posi- 
tions. The parameters of pharyngeal acid reflux are 
shown in Table 2. Comparison between the groups 
showed that pharyngeal reflux episodes occurred sig- 
nificantly more often in the granuloma patients than 
in the control group (p = .009). The durations of the 
reflux episodes in the 5 healthy subjects ranged be- 
tween 6 and 12 seconds (mean, 9.0 seconds). The 
durations of the acid episodes in the granuloma group 
varied from 6 seconds to 3 minutes (mean, 29 sec- 
onds). In both groups, isolated pharyngeal pH de- 
clines were observed; also, anterograde pH declines 
from the pharynx to the distal esophagus were ob- 
served during oral intake. These episodes were disre- 
garded as reflux episodes. Of a total of 279 acid epi- 
sodes in the pharynx, 96 (34%) were accepted as re- 
flux episodes, and the rest, 183 episodes (66%), were 
considered artifactual. 


Based on our pH monitoring, 4 different subgroups 
of granuloma patients could be identified: 1) patients 
with normal 24-hour pH monitoring results (n = 4); 


common in granuloma patients than in asymptom- 
atic healthy subjects. The reflux episodes are typi- 
cally short and occur predominantly in an upright 
position. This kind of reflux pattern has previously 
been found in patients with subglottic stenosis and 
laryngeal cancer,!9 vocal nodules,!? and reflux lar- 
yngitis.!9.!* The short duration of the reflux episodes 
may explain the low incidence of classic reflux symp- 
toms in patients with contact granuloma. 


Early studies emphasized the role of mechanical 
trauma as an etiologic factor in granuloma forma- 
tion.!’ This trauma was especially marked at low 
speech fundamental frequencies and during cough- 


TABLE 3. BREAKDOWN OF PH STUDY RESULTS OF 
GRANULOMA PATIENTS VERSUS HEALTHY 
CONTROLS 








No. of No. of 
Granuloma Healthy 
Patients Controls 


PH Monitoring (n=26) (n=/9) 





Normal distal esophageal pH without 

reflux into pharynx 4 14 
Normal distal esophageal pH with reflux 

into pharynx 9 
Pathological distal esophageal pH 

without reflux into pharynx 5 0 
Pathological distal esophageal pH with 

reflux into pharynx 8 3 


N 











Ko 
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ing and clearing of the throat. Thus, voice therapy 
has traditionally been used as a treatment for contact 
granulomas. This treatment decreases hyperfunction, 
vocal fry, and monotony, and increases fundamental 
frequency in some patients,!® but it has not short- 
ened the healing time substantially.! Extraesophageal 
reflux may be a contributory factor in the persistence 
of granuloma, even though the cause-and-effect re- 
lationship between reflux and granuloma has not been 
established in this study. Controlled studies on anti- 
reflux treatment in granuloma patients are to be per- 
formed. 


We did not find any significant differences between 
the groups with manometry. The LES resting pres- 
sures were almost similar in both investigated groups, 
and most of the subjects had normal esophageal peri- 
stalsis. Thus, esophageal manometry does not seem 
to be useful in the diagnosis of EER. However, the 
pressure of the UES or the transient relaxations of 
the UES, which could not be evaluated by our 
method, may give us more information in the future 
about the pathophysiology of pharyngeal reflux. 


Another predominant finding in this study was that 
a higher percentage of esophageal acid episodes 
reached the pharynx in the granuloma group than in 
the control group. There was no statistically signifi- 
cant difference in the number of distal reflux episodes 
in an upright position between these groups, and the 
majority of pharyngeal acid episodes occurred only 


‘in an upright position. Also of particular interest are 


the 9 patients who had normal esophageal monitoring 
results but had pharyngeal reflux associated with the 
physiological esophageal reflux episodes. This find- 
ing suggests that granuloma patients may have ab- 
normalities in the mechanism that normally prevents 
EER. Ulualp et al!? have shown that stimulation of 
the human pharyngeal mucosa by irrigation of minute 
amounts of water stimulates the pharyngo-UES con- 
tractile reflex. They suggest that this reflex may be 
activated during pharyngeal reflux of gastric acid, 
thus increasing the UES pressure. It could be con- 
sidered as one of the protective mechanisms against 
contact of acid refluxate with the hypopharyngeal 
and laryngeal structures. In the same study, they 
showed that the volume of liquid required to trigger 
this reflex is significantly larger in patients with pos- 
terior laryngitis than in healthy controls. Their find- 
ings suggest an altered afferent sensory limb of this 
reflex in patients with posterior laryngitis. Another 
explanation for EER may be that a forceful acid re- 
flux event causes abrupt and complete relaxation of 
the UES.°.21 Because it occurs independently of 
swallowing, the larynx is not elevated, the laryngeal 
outlet is open, and the posterior glottis is in direct 
contact with the refluxate. However, these mecha- 


nisms have not been studied in patients with contact 
granuloma. 


The selection of healthy controls is important in 
distinguishing between normal and abnormal values 
in reflux studies. Symptomatic GER is a common 
disorder; an estimated 7% of adults experience heart- 
burn on a daily basis.22 In our study, 19 healthy vol- 
unteers denied heartburn and acid regurgitation, but 
3 of them had pathological pH recordings from the 
distal esophagus and reflux episodes to the pharynx. 
Stål et al?3 have shown that hiatus hernia is a com- 
mon finding in healthy subjects and that it predis- 
poses them to GER. In addition, in their study, symp- 
tom evaluation was not sufficient to exclude GER in 
healthy subjects, and therefore, esophageal abnor- 
malities should be excluded by esophagogastroscopy. 
Regrettably, the subjects in our study were not exam- 
ined by endoscopy. Moreover, our healthy controls 
were on average 20 years younger (mean age, 32 
years) than the patients (mean age, 52 years). How- 
ever, the impact of age on the degree of distal esopha- 
geal reflux differs among previous studies.24.25 Ac- 
cording to Richter et al,?® there is a trend toward in- 
creased reflux with age, but they concluded that for 
clinical purposes, age did not have an important ef- 
fect on the physiological parameters of acid reflux. 
Pharyngeal reflux should, of course, be studied in this 
respect. 


There are sparse normative data concerning pha- 
ryngeal reflux events. Koufman!® had earlier reported 
that none of 12 normal subjects had any evidence of 
pharyngeal reflux. Shaker et al!? studied 12 healthy 
volunteers, 2 of whom had 1 reflux episode each. In 
the current study, 3 of the healthy volunteers had 1 
pharyngeal reflux episode each, 1 had 2 episodes, 
and 1 had 8 episodes during the 24-hour registration. 
The overall number of esophageal reflux episodes 
has previously proved to be poorly reproducible. !4 
Probably, pharyngeal reflux may also be intermittent, 
and day-to-day variability of these reflux events may 
account for the discrepancy. That 1 healthy control 
had 8 pharyngeal reflux events and also pathological 
esophageal pH monitoring results may suggest that 
the mere entry of gastric contents into the pharynx 
does not necessarily result in mucosal injury and de- 
velopment of laryngeal abnormalities. 


In summary, increased EER could be detected in 
the granuloma group. Furthermore, the percentage 
of distal esophageal reflux episodes that reached the 
pharynx was significantly higher in granuloma pa- 
tients than in asymptomatic control subjects. In 
healthy subjects, there were episodes in which re- 
fluxed gastric acid reached the pharynx, but they were 
uncommon. 
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EXTRACAPSULAR SPREAD IN IPSILATERAL NECK AND 
CONTRALATERAL NECK METASTASES IN LARYNGEAL CANCER 


YUSUFHAN SUOGLU, MD 
BURAK ERDAMAR, MD O. SAMI KATIRCIOGLU, MD 


M. CAN KARATAY, MD TAYFUN SUNAY, MD 
ISTANBUL, TURKEY 


We investigated the incidence of extracapsular spread (ECS) and the impact of ECS on contralateral neck metastasis in 67 
patients with ipsilateral nodal metastasis (IpN+) whose records were extracted retrospectively from those of 155 laryngeal cancer 
patients. The incidence of ECS in association with variables was determined: T stage, N stage, tumor location, tumor extension, 
number of positive nodes, and contralateral neck status. The variables were evaluated to identify their impact on the rates of con- 
tralateral neck metastasis (CNM) and 3-year survival. Of the 67 patients, 30 (44.7%) had ECS. A significant relationship was found 
between ECS positivity and increased N stage, tumor extension up to the midline, number of positive nodes, and CNM (p = .04, p = 
.0001, p = .018, p = .0001, respectively). Multivariate analysis revealed that N stage (p = .002; odds ratio, 3.5517) and the presence 
of ECS (p = .0036; odds ratio, 7.7840) in IpN+ were associated with the greatest risk of CNM. The 3-year survival rate of patients 
with ipsilateral ECS was significantly lower than that of patients without ECS (43% versus 81%, p = .0002). Both CNM and presence 
of ECS in IpN+ emerged as significant independent predictors for survival with Cox multivariate analysis (p = .0086 and p = .0234, 
respectively). This result indicates the necessity of treating the contralateral NO neck in cases of IpN+ with ECS. 


KEY WORDS — extracapsular spread, laryngeal cancer, lymph node metastasis, prognostic factor. 


INTRODUCTION patients with ipsilateral nodal metastasis (IpN+) is 

In laryngeal cancer patients, evaluation of the neck controversial. To our knowledge, no study has been 
is important for staging of the disease, as it is in all conducted on the influence of ECS involvement in 
other cases of head and neck squamous cell carcino- IpN+ on occult, concurrent, or late contralateral neck 
ma. In addition to the site of the primary tumor, the disease. 
extension of the tumor, and the infiltration pattern, The present study was designed to illuminate 1) 
the presence of cervical lymphatic metastasis sug- the incidence of ECS associated with variables such 
gests the treatment regimen and may predict the prog- as T stage, N stage, tumor location within the larynx, 
nosis. However, neck status is the most important tumor extension according to midline, number of met- 
prognostic factor. Many variables have been impli- astatic nodes, and contralateral neck status; and 2) 
cated in the prognosis of laryngeal carcinoma. Among the impact of the variables together with the presence 
these, cervical lymphatic status is the variable with of ECS on contralateral neck disease in patients with 
the greatest prognostic significance in both univariate advanced laryngeal carcinoma. 
and multivariate studies. !-? The 5-year survival peri- 
od is decreased by 50% in cases with lymph node MATERIALS AND METHODS 
metastasis, compared to those without nodal involve- We reviewed 189 patients (175 men, 14 women; 
ment.|2.8,.9 Furthermore, histopathologic character- median age, 55.9 years; range, 34 to 75 years) with 
istics of metastatic lymph nodes affecting prognosis, previously untreated laryngeal carcinoma who re- 
such as size, quantity, fixation to surrounding tissue, ceived diagnosis and treatment in the Department of 
and presence of extracapsular spread (ECS), have Otorhinolaryngology, Istanbul School of Medicine, 
been investigated in previous studies. Histopathologi- over the 3-year period from June 1992 to June 1995. 
cally, ECS is defined as penetration of tumor through The medical charts of the patients with laryngectomy 
the capsule of involved lymph nodes. The presence (partial or total) and neck dissection completed by 
of ECS in 1 or more metastatic nodes has been found June 1998 were reviewed retrospectively. Only the 
by some to be the most important prognostic param- patients who had completed 3 years of follow-up were 
eter.45,10-14 However, other investigators have failed included in the study. Patients who died of intercur- 
to show a direct correlation between ECS involve- rent diseases without evidence of local-regional re- 
ment and prognosis.!>-!6 On the other hand, manage- current disease were excluded from all analyses. Pa- 
ment of the contralateral neck in laryngeal cancer tients with fixed lymph nodes, with an N3 neck, with 
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TABLE 1. TNM STAGING AFTER INITIAL OPERATION 
ACCORDING TO UICC (1987)!7 











Nodal Stage T1 T2 T3 T4 Total 
NI 0 13 19 4 36 
N2a 0 4 8 4 16 
N2b 0 4 8 3 15 
Total 0 21 35 o H 67 





local-regional recurrences or distant metastasis at 
presentation, or who were deceased or lost to follow- 
up were excluded. Among 155 patients, 67 patients 
with histologic ipsilateral nodal involvement who had 
undergone bilateral neck dissection were included 
in the study. Neck dissection on the clinically NO 
contralateral side of the neck had been carried out in 
selected cases as a prophylactic procedure 3 weeks 
after the initial operation because of tumor extension, 
an ipsilateral N2 neck, or the suspicion of occult me- 
tastasis on the contralateral side of the neck because 
of the advanced tumor stage. We defined contralateral 
neck metastasis (CNM), which is actually an occult 
metastasis, as a clinically NO contralateral neck that 
was found to have nodal involvement at elective con- 
tralateral neck dissection in a patient with IpN+. Thus, 
to constitute a homogeneous CNM group, we ex- 
cluded the patients with clinically N+ necks and those 
with delayed CNM from the analysis. 


All surgical margins were negative on primary sur- 
gery as confirmed by routine frozen-section histo- 
logic examination. Modified radical neck dissection 
had been performed in all cases. Postoperative radio- 
therapy consisted of 50 to 65 Gy (median, 61 Gy) 
administered to metastatic necks with ECS after ini- 
tial treatment. Patients who had CNM also received 
postoperative radiotherapy. 


The clinical and pathological records were enrolled 
in statistical analysis to define the following variables: 
age and sex of the patients, tumor location, periopera- 
tive T stage and pathologic N stage (according to 
initial neck dissection), number of nodes with posi- 
tive histology, tumor extension (whether it crossed 
the midline or not), presence or absence of ECS in 
IpN+, and contralateral neck status at the time of pri- 
mary surgical treatment. The 1987 UICC version of 
the TNM classification was used (Table 1).!7 We ana- 
lyzed the correlation between variables and the inci- 
dence of ECS in the ipsilateral neck, as well as other 
variables, including ECS in IpN+, that could have a 
potential effect on CNM. 


The SPSS statistical software package (version 9.0, 
1998) was used to analyze the results. In ipsilateral 
positive necks, the incidence of ECS in each T stage, 
N stage, tumor location, tumor extension in the 
larynx, and number of metastatic nodes and in pa- 


tients with CNM was analyzed with cross-tabulations 
(x? test, Fisher’s exact test) and unpaired f-tests when 
appropriate. Similarly, the incidence of CNM was 
determined according to these variables, including 
ECS. The variables under study were evaluated for 
ECS incidence, and their predictive value with re- 
spect to CNM was determined by means of the step- 
wise logistic regression analysis. Survival curves were 
calculated with the Kaplan-Meier model. The impacts 
of the variables such as T stage, N stage, tumor ex- 
tension, tumor location, positive node number, ECS 
positivity, and CNM on survival were assessed for 
significance with the log-rank test. The Cox multiple 
regression model was used to determine which vari- 
ables had an independent effect on survival, with the 
variables being introduced stepwise in the order 
established by probability. The variable with the high- 
est likelihood of the preceding parameter was in- 
cluded. The process continued until no further vari- 
ables could be added at a significance level of .05.!8 


RESULTS 


Among the 155 patients who fulfilled the inclusion 
criteria for the study, 88 patients had pNO status, 47 
had sole IpN+, and 20 had bilateral neck disease of 
which the CNM was occult at initial surgery. Because 
we aimed to investigate the impact of ECS on CNM 
in addition to determining the ECS incidence at IpN+ 
and its relationship with other variables, we performed 
Statistical analysis on the IpN+ group (n = 67). In 
order to provide a uniform CNM patient group and 
to better analyze the true impact of ECS on CNM, 
we selected only patients with histologically proven 
CNM at elective contralateral neck dissection as the 
CNM group. 


Among the neck specimens of these 67 IpN+ pa- 
tients, 30 (44.7%) had ECS in ipsilateral metastatic 
lymph nodes (Table 2). When the 67 patients were 
grouped according to T stage, 21 patients were T2 
(31.3%), 35 were T3 (52.2%), and 11 were T4 
(16.4%). We found ECS in 42.8%, 40%, and 63.6% 
of T2, T3, and T4 tumors, respectively. No significant 
correlation existed between T stage and the presence 
of ECS in IpN+ (p =.124). A significant relationship 
was found between the presence of ECS and N stage 
(p = .041). The incidence of ECS increased with ad- 
vanced N stage, from 36.1% in N1 to 73.3% in N2b. 
In the supraglottic and transglottic tumors that con- 
stituted the study group, ECS was present in 43.4% 
and 45.4% of cases, respectively. No significant dif- 
ference was observed with regard to the presence of 
ECS in these two anatomic locations (p > .05). Con- 
sidering the ECS incidence in the tumor extension 
with respect to the midline (Table 2), we found that 
ECS was present in 20 cases (74%) with lesions ex- 
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TABLE 2. UNIVARIATE ANALYSIS OF RELATIONSHIP 
BETWEEN VARIABLES AND PRESENCE OF ECS 


TABLE 4. UNIVARIATE ANALYSIS OF RELATIONSHIP 
BETWEEN VARIABLES AND CNM 











Patients 
With IECS+ 
Variables IpN+ No. % p 
T NS* 
1 0 0 0 
2 21 9 42.8 
3 35 14 40.0 
4 11 7 63.6 
Total 67 30 44.7 
N .041* 
1 36 13 36.1 
2a 16 6 37.5 
2b 15 11 73.3 
Supraglottic 23 10 43.4 NSt 
Transglottic 44 20 45.4 
Extension .0001f 
Beyond midline 27 20 74.0 
Unilateral 40 10 25.0 
No. of metastatic nodes .018§ 
1 48 17 35.4 
2 6 4 66.6 
23 13 9 69.2 
CNM .0001+ 
Yes 20 16 77.2 
No 47 14 28.8 


ECS — extracapsular spread; IpN+ — ipsilateral positive node(s); 
JECS — ipsilateral positive node(s) with extracapsular spread; NS 
— not significant; CNM — contralateral neck metastasis. 

*y2 Test. 

tFisher’s exact test. 

§t-Test. 





tending beyond the midline (n = 27). In cases with 
unilateral lesions (n = 40), this figure was 25%. La- 
ryngeal lesions extending beyond the midline had a 
significantly higher incidence of ECS than did uni- 
lateral laryngeal lesions (p = .0001). With regard to 
the association between the number of metastatic 
nodes and ECS involvement, we found a higher ECS 
incidence in neck metastasis with 3 or more positive 
nodes (69.2%). The relationship between the number 
of positive nodes and the presence of ECS was deter- 
mined to be significant (p = .018). Comparing the 
presence of ECS at the initial ipsilateral neck dissec- 
tion of patients who had CNM with that of patients 
who had ipsilateral neck disease alone, the ratio was 
77.2% (16 of 22) versus 28.8% (14 of 45; p = .0001). 
Multivariate logistic regression revealed that the most 
significant association with ECS existed in patients 


TABLE 3. MULTIVARIATE LOGISTIC REGRESSION 
ANALYSIS OF ECS INCIDENCE WITHIN VARIABLES 











Variables R Odds Ratio p 
Extension —1.7607 1719 .004 
CNM . 1.7930 6.0075 .009 








CNM 
Variables Patients No. % p 
T NS* 
1 0 0 
2 21 4 19.0 
3 35 12 34,2 
4 11 4 36.3 
Total 67 20 29.8 
N . .0001* 
1 36 3 8.3 
2a 16 8 50.0 
2b 15 9 60.0 
Supraglottic 23 6 26.0 NSt 
Transglottic 44 14 31.8 
Extension .002T 
Beyond midline 27 16 59.2 
Unilateral 40 4 10.0 
No. of metastatic nodes .0001§ 
1 48 9 18.7 
2 6 2 33.3 
23 13 9 69.2 
IECS 0001F 
Yes 30 17 56.6 
No 37 5 13.5 
*y2 Test. 
+Fisher’s exact test. 
§t-Test. 





with tumors extending beyond the laryngeal midline, 
and the second most significant was in those with 
CNM (Table 3). 


Of the 67 patients who had elective neck dissection 
on the contralateral neck side, occult CNM was dis- 
covered in 20 by histologic examination. Thus, the 
CNM ratio was found to be 32.8% among the pa- 
tients with IpN+, and these patients constituted our 
main group for investigating impact factors in this 
study. Nodal involvement on the contralateral neck 
side was discovered during initial treatment (3 weeks 
after previous neck dissection) in this group. Table 4 
illustrates the relationship between variables and 
CNM in IpN+ patients. The variables N stage, exten- 
sion of the tumor across the midline, number of posi- 
tive nodes, and presence of ECS showed a statistically 
significant effect on CNM (Table 4). In this analysis, 
no significant differences were found for age, gender, 
tumor location in the larynx, and T stage. The analysis 
of nodal status in all patients demonstrated a signifi- 
cantly increased risk of CNM with increasing N stage 
(p = .002). As shown in Table 4, CNM developed in 
8.3% of N1 cases, 50% of N2a cases, and 60% of 
N2b cases. Sixteen of the 20 patients with bilateral 
neck disease had ECS on the ipsilateral side of the 
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TABLE 5. MULTIVARIATE LOGISTIC REGRESSION 
ANALYSIS OF RELATIONSHIP BETWEEN VARIABLES 











o AND CNM E 
Variables R Odds Ratio p 

N stage 1.2674 3.5517 .0020 
IECS 2.0521 7.7840 0036 





neck; 12 (60%) of the contralateral neck specimens 
from these patients also had ECS. Patients with histo- 
logically proven CNM have a significantly higher 
ECS incidence on the ipsilateral side of the neck than 
do those with ipsilateral neck disease alone (p= 
.0001). On the other hand, the risk of CNM was sta- 
tistically higher for patients with positive nodes with 
ECS on the ipsilateral side of the neck than for those 
with positive nodes without ECS (17 of 30 or 56.6% 
versus 5 of 37 or 13.5%; p = .0001). Considering the 
tumor extension in the larynx with respect to the la- 
ryngeal midline, we found that tumors extending to 
the midline had a significantly higher risk of CNM 
(16 of 27 or 59.2% versus 4 of 40 or 10%). Analo- 
gously, it was noticed that most patients had laryngeal 
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tumors extending beyond the midline in the bilateral 
neck disease group (16 of 20 or 80%). In other words, 
the risk of CNM was statistically lower for patients 
with unilateral laryngeal lesions than for those with 
tumors extending over the midline (p = .002). The 
number of nodes with ECS correlated with an in- 
creased probability of CNM (p = .004). The inci- 
dence of CNM was 18.7% in patients with 1 histo- 
logically positive node; it was 33.3% in patients with 
2 and 69.2% in patients with 3 or more positive lymph 
nodes in ipsilateral neck specimens. The rate of CNM 
was significantly higher in patients with 3 or more 
metastatic nodes than in the others (p = .0001). 

A multivariate stepwise logistic regression analysis 
adjusting for the variables mentioned above revealed 
that N stage (p = .002; odds ratio, 3.5517) and the 
presence of ECS (p = .0036; odds ratio, 7.7840) on 
the ipsilateral neck side were associated with the 
greatest risk of CNM (Table 5). 


By the end of the follow-up period, 25 patients 
(37.3%) had died of cancer. The causes of death were 


Disease-specific 3-year survival. A) Overall. B) Accord- 
ing to contralateral neck metastasis (log-rank analysis, 
p = .0001). C) According to presence of extracapsular 
spread (ECS) in ipsilateral positive node(s) (log-rank 
analysis, p = .0002). 
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TABLE 6. UNIVARIATE ANALYSIS OF VARIABLES 
FOR 3-YEAR SURVIVAL 











Variables No. of Patients Survival (%) p* 
T .0478 
2 22 82 
3 35 57 
4 10 40 
N .0018 
1 36 81 
2a 16 50 
2b 15 33 
ECS .0002 
Yes 30 43 
No 37 81 
Extension .0259 
Beyond midline 27 48 
Unilateral 40 73 
No. of metastatic nodes .0608 
1 48o 7 
2 6 50 
23 13 38 
CNM 0001 
Yes 20 30 
No 47 76 


*Log-rank analysis. 


as follows: ipsilateral neck recurrence in 6 patients, 
delayed contralateral neck recurrence in 5, distant 
metastasis in 7, local recurrences in 3, and another 
neoplasm in 4. Eight patients who developed ipsi- 
lateral or contralateral recurrences had ECS-positive 
nodes on the previously dissected neck side. Two of 
the patients with distant metastasis had simultaneous 
contralateral neck recurrences, and all had IpN+ and 
5 had ECS on the ipsilateral neck side. Of the 25 
patients who died of cancer, 18 (60%) had ECS-posi- 
tive nodes. Of the remaining 42 patients who sur- 
vived, 12 (28.5%) had ECS. 


The disease-specific survival rate was 64.1% at 3 
years (see Figure, A). The relationship between the 
variables and survival is shown in Table 6. The 3- 
year survival rate of patients with ipsilateral ECS was 
significantly lower than that of patients without ECS 
(43% versus 81%, p = .0002; see Figure, C). Whereas 
the survival rate was 30% for patients who had CNM, 
this rate was 76% in patients with ipsilateral neck 
disease alone (p = .0001; see Figure, B). Contralateral 
neck metastasis and the presence of ECS at ipsilateral 
nodes emerged as significant independent predictors 
on Cox multivariate analysis, with p = .0086 and p= 
.0234, respectively (Table 7). 


DISCUSSION 


In the surgical treatment of laryngeal cancer, neck 
dissection is performed as a measure supplementary 
to the primary tumor resection in cases with possible 


TABLE 7. MULTIVARIATE (COX REGRESSION) 
ANALYSIS OF RELATIONSHIP BETWEEN VARIABLES 











AND SURVIVAL 
Variables R Odds Ratio p 
CNM 1.1494 3.1563 .0086 
IECS 1.0901 2.9746 .0234 





cervical spread or in those with a clinically palpable 
cervical node. According to the TNM system, nodal 
involvement is the most important factor in estimat- 
ing survival. Nodal histopathologic factors, including 
the number and size of the positive nodes, the cervical 
level, and ECS involvement, have prognostic signif- 
icance in terms of increased neck recurrences, de- 
creased survival, and distant metastasis.”-8-11-19-21 In 
addition to the number and size of metastatic nodes, 
the importance of histopathologic assessment of the 
ECS has been stressed in several articles.3.4.6.11,22-24 
In occult nodal metastasis, clinically classified as NO, 
ECS was observed in 49% in the study of Alvi and 
Jobnson,? and it contributed to an unfavorable prog- 
nosis. Snyderman et al25 found ECS in 20% of NO 
cases; the rate increased in parallel to N stage. Extra- 
capsular spread is reported to exist in 20% to 43% of 
involved nodes smaller than 1 cm in diameter.3+!! 
In the relevant literature, the reported ECS ratios in 
different series range from 20% to 80%.245,11,25-27 
In particular, the presence of ECS in positive nodes 
is an unfavorable prognostic sign associated with an 
increased frequency of nodal recurrences and a short- 
ened survival.>-9.28 


Neck metastasis of laryngeal cancer limited within 
the hemilarynx tends to occur on the ipsilateral neck 
side, and it is generally believed that contralateral or 
bilateral neck metastasis develops in tumors located 
in the midline.2? Marks et al,3° however, reported 
that contralateral involvement is more probably re- 
lated to ipsilateral cervical spread. In that study, only 
a minority (7%) of the cases with CNM had central 
supraglottic tumors located in the false vocal fold, 
suprahyoid region, or infrahyoid region. Thus, CNM 
of laterally limited lesions was related to IpN+.2° In 
the case of tumor location in the midline structure or 
extension to the other side, the contralateral side of 
the neck is at risk of regional metastasis. Contralateral 
neck metastasis from unilateral laryngeal cancer is 
diagnosed at complementary neck dissection of the 
other side or when it becomes manifest during follow- 
up. Apart from the size of the primary tumor and its 
extension, any IpN+ lesion has a possible risk for 
CNM, particularly if the metastatic nodes in the ipsi- 
Jateral neck are multiple and large.31.32 


Levendag et al?3 reported that 78% of neck relapses 
were in the opposite, undissected side of the neck in 
patients with squamous cell carcinoma. Gallo et al*4 
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found that 38.5% of patients with supraglottic cancer 
had bilateral neck disease, irrespective of elective or 
delayed treatment of the previously negative contra- 
lateral side of the neck. Lutz et al35 found a 20% rate 
of recurrent neck disease in supraglottic cancer pa- 
tients. Of these, 75% had metastasis on the contra- 
lateral or undissected side of the neck.35 The finding 
of Pitman et al*° that 38% of their patients had T2 
lesions suggests that advanced primary disease is not 
the only source of regional recurrence. The regional 
recurrence rates in patients with pNO versus pN+ 
were found to be statistically different (38% versus 
12.9%, p < .02). Moreover, ECS positivity was almost 
significantly higher in the pN+ group (p = .06).36 
Similarly, Gallo et al reported that the presence of 
ECS on the ipsilateral side of the neck correlated with 
an overall higher, although not significantly so, risk 
of CNM. However, Myers and Alvi37 noted CNM 
and high rates of ECS on the dissected side of the 
neck at initial surgery in their series and stressed the 
importance of bilateral neck dissection to decrease 
the incidence of regional recurrence. In the report of 
Kowalski et al,?8 the presence of metastatic nodes 
with ECS was found to be a significant indicator of 
the risk of neck recurrence and CNM. 


Although this retrospective study was designed to 
determine the incidence of ECS in metastatic lymph 
nodes and its correlation with T stage, N stage, num- 
ber of metastatic nodes, site of the tumor, and exten- 
sion of the lesion, we recognized that ECS positivity 
on the ipsilateral neck side might also have an impact 
on CNM. In this context, the influence of histologic 
ECS in IpN+ neck specimens on CNM was investi- 
gated along with the variables named above. Thus, 
we evaluated our cases in order to assess ECS in ipsi- 
lateral metastatic lymph nodes, as well as the relation- 
ship between ECS positivity and nodal metastasis in 
contralateral necks that were considered clinically NO. 
In our study, the CNM ratio was found to be 29.8% 
in patients with advanced laryngeal cancer. Tumor 
extension beyond the midline, increasing N stage, 
ECS positivity, and more than 3 ipsilateral metastatic 
nodes were shown to be significant risk factors for 
CNM. However, in multivariate analysis, N stage and 
ECS positivity were the variables that independently 
correlated with the greatest risk of CNM. Thus, we 
suggest that ipsilateral ECS contributes to a poor prog- 
nosis by means of increasing the CNM incidence. 
Although IpN+ with ECS is suggested to be an unfav- 
orable prognostic factor in many studies, we could 
not find a comprehensive study involving the impact 
of ipsilateral ECS on CNM. In our study comprising 
155 cases, ipsilateral ECS was observed in 30 of 67 
pN+ cases (48.3%). When the presence of ipsilateral 
ECS was evaluated according to contralateral neck 


Status, ipsilateral ECS was observed in 77.2% of 20 
cases with CNM, whereas it was present in 28.8% of 
cases with ipsilateral nodal involvement alone (p= 
.0001; Table 2). In other words, there existed a signifi- 
cant relationship between the presence of ECS in ipsi- 
lateral cervical metastasis and CNM. Furthermore, 
ECS was present in 60% of the contralateral neck 
specimens. In laryngeal carcinoma patients with ip- 
silateral metastases, the number of positive lymph 
nodes is reported to be an important prognostic fac- 
tor. 1:3940 In the present study, a statistically significant 
association was found between the number of posi- 
tive lymph nodes and the presence of CNM in pa- 
tients with IpN+ (p = .0001). 

In our series, 11 of the 67 patients had regional 
recurrences; 4 of them also had distant metastasis. 
In 8 of the 11 cases with regional recurrences, ECS 
had already been detected during the previous neck 
dissection. Of 7 patients who developed distant me- 
tastasis, all had ipsilateral neck disease, and 5 had 
ECS as well. Because only patients with IpN+ with 
ECS and histologically proven CNM received post- 
operative radiotherapy, we could not evaluate the im- 
pact of adjuvant irradiation on survival in our series. 

The impact of ECS involvement on the contralater- 
al side of the neck in patients with nodal metastasis 
is shown in Table 4. Likewise, 43% of patients with 
ipsilateral nodal metastasis with ECS survived 3 
years, in contrast to 81% of patients with ipsilateral 
nodal metastasis without ECS (p = .0002). Although 
increased N stage, tumor extension beyond the mid- 
line, T stage, and the number of positive nodes have 
shown significant correlation with survival, a Cox 
regression analysis revealed a trend toward decreased 
survival among patients with CNM and ECS positiv- 
ity (Table 7). 

In laryngeal carcinoma patients, in whom the prog- 
nosis is largely influenced by nodal involvement, 
elective management of the NO neck concordant with 
the primary treatment method is mandatory. Depend- 
ing on the clinical and pathological features of the 
primary tumor and the ipsilateral neck, elective man- 
agement of the contralateral NO neck can also be man- 
datory. The histologic features, the number and size 
of involved lymph nodes obtained by neck dissec- 
tion, and the presence of ECS are important deter- 
mining factors for adjunctive therapy. Chemother- 
apy and radiotherapy in addition to surgical treatment 
are reported to be beneficial in patients with ECS.4! 
In cases of IpN+ with ECS, the probability of the 
occurrence of CNM was found to be higher than in 
N+ cases without capsule invasion. This finding 
stresses the importance of contralateral neck dissec- 
tion for diagnosis and treatment. The indication for 
elective neck dissection must not rely solely on the 





A 
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basis of the site of the primary lesion and the his- 
tologic grade. The risk of delayed CNM is a major 
cause for concern; the ipsilateral neck dissection 
specimen could suggest the need for comprehensive 
neck dissection of the other side when a satisfactory 
clinical assessment cannot be done or when socio- 
economic factors will not permit regular follow-up. 
In other words, in the case of an IpN+ neck with 





ECS, it is advisable to perform a comprehensive neck 
dissection on the other side rather than waiting for 
neck disease to be manifested. 


In conclusion, we suggest that the presence of ECS 
in ipsilateral regional lymph nodes of patients with 
laryngeal carcinoma can be used as an indicator in 
managing the contralateral side of the neck. 
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EFFECTS OF ADDING STEROIDS, IN VITRO IRRADIATION, OR BOTH 
TO CYCLOSPORINE IMMUNOSUPPRESSION IN THE MURINE 
LARYNGEAL TRANSPLANTATION MODEL 
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Adding oral prednisone or in vitro organ irradiation to an immunosuppressive regimen for laryngeal transplantation may allow 
cyclosporine dosage to be decreased without compromising organ rejection rates. Using an established rat laryngeal transplant 
model, we sought to determine whether combined immunosuppressive regimens decreased rejection rates. Twenty-two treatment 
groups of 10 animals each were studied. Cyclosporine was administered at 2 different historically low dosages (1.5 and 2 mg/kg per 
day) alone, in combination with prednisone, and with in vitro irradiation of the harvested larynx. At 15 days after transplantation, 
cyclosporine combined with prednisone did not significantly alter the mean severity of rejection. At 30 days after transplantation, 
rats treated with a combination of cyclosporine 2 mg/kg and prednisone 1 mg/kg showed the least amount of rejection. Rejection 
rates were significantly higher with 2 mg/kg cyclosporine alone (p < .00024). Irradiation did not significantly decrease the severity of 
rejection. These results demonstrate that at 1 month after transplantation, a combined steroid and cyclosporine regimen reduces the 
severity of rejection after laryngeal transplantation in an animal model. Our findings suggest that laryngeal transplant patients may 
be treated with low cyclosporine doses without experiencing organ rejection if steroids are added to the immunosuppressive regimen. 


KEY WORDS — cyclosporine, larynx, muridae, prednisone, radiation, steroids, trachea, transplantation, voice. 


INTRODUCTION 


In 1972, Borel et al demonstrated that cyclosporin 
A, an extract from the fungus Tolypocladium infla- 
tum Gams, has immunosuppressive properties.! Six 
years later, cyclosporine was used for the first time 
to treat a renal transplant recipient, and it since has 
remained a cornerstone of post-transplantation immu- 
notherapy.? Whereas azathioprine, the prototypic im- 
munosuppressive drug, inhibits cell division of all 
elements of the immune system, cyclosporine selec- 
tively alters adaptive immune responses by inhibit- 
ing activated T-lymphocytes and preventing them 
from manufacturing or releasing interleukin-2.34 


Unfortunately, cyclosporine is not without toxici- 
ties, which include hepatotoxicity, neurologic effects 
such as tremors, paresthesias, muscle weakness, and 
seizures, and the most common side effect, nephro- 
toxicity.> In almost all patients, cyclosporine causes 
at least a 20% reduction in renal function, which can 
progress to fulminant renal failure. Nephrotoxicity 
can occur at any concentration, but is less frequent 
with lower doses.’ To reduce doses of cyclosporine 
while maintaining immunosuppression, steroids have 
been added to the post-transplantation regimen. In 


renal transplant patients, Starzl et al®:? demonstrated 
that adding steroids to the immunosuppression regi- 
men allowed cyclosporine doses to be lowered, re- 
ducing the severity of nephrotoxicity without increas- 
ing rejection rates. In cardiac transplantation, com- 
bining low-dose steroids with cyclosporine substan- 
tially decreased the incidence and severity of rejec- 
tion episodes.1° 


The first successful human laryngeal transplanta- 
tion was performed by a team led by the senior au- 
thor (M.S.) in January 1998, and the patient contin- 
ues to function well. By adding prednisone to the 
standard cyclosporine immunosuppressive regimen, 
we were able to reduce the cyclosporine dosage with- 
out signs of rejection. To determine whether these 
results could be generalized, we conducted this study 
to investigate the efficacy of combination immuno- 
suppressive regimens in a rat model that has been 
used to study cyclosporine dosing levels, adminis- 
tration routes, and a variety of immunosuppression 
variables since 1993.11-13 


We hypothesized that combining prednisone with 
cyclosporine therapy could allow the cyclosporine 
dose to be decreased without compromising immu- 
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nosuppression. In addition, we wanted to determine 
whether in vitro irradiation would enhance the im- 
munosuppressive effect of the cyclosporine-steroid 
combination regimens, as it does with cyclosporine 
alone.!3 Our prospective, controlled trial contained 
22 treatment arms of 10 animals each. The variables 
were levels of cyclosporine, levels of prednisone, ad- 
ministration of irradiation to the larynges in vitro be- 





IMMUNOSUPPRESSIVE REGIMENS FOR GROUPS | 
In Vitro 
Irradiation 





Cyclosporine Prednisone 
(mg/kg (mg/kg 





Group Name per day) per day) (cGy) 

15-day groups 
Cy2/15d 2.0 0 0 
Cy1.5/15d 1.5 0 0 
P1/15d 0 1.0 0 
P0.5/15d 0 0.5 0 
P0.25/15d 0 0.25 0 
PO.1/15d 0 0.1 0 
Cy2/P1/15d 2.0 1.0 0 
Cy2/P0.5/15d 2.0 0.5 0 
Cy2/P0.25/15d 2.0 0.25 0 
Cy1.5/P1/15d L5 1.0 0 
Cy1.5/P0.5/15d 1:5 0.5 0 
Cy 1.5/P0.25/15d 1.5 0.25 0 
Cy2/P1/X/15d 2.0 1.0 800 
Cy1.5/P1/X/15d 1.5 1.0 800 
Control 0 0 0 

30-day groups 
Cy2/30d 2.0 0 0 
Cy 1.5/30d 1.5 0 0 
Cy2/P 1/30d 2.0 1.0 0 
Cy2/P0.5/30d 2.0 0.5 0 
Cy1.5/P1/30d 1:5 1.0 0 
Cy2/P 1/X/30d 2.0 1.0 800 
Cy 1.5/P1/X/30d 1.5 1.0 800 








Fig 1. Rat laryngeal transplant model. Do- 
nor’s right common carotid artery (CC) is 
anastomosed to recipient’s left common 
carotid artery, while venous outflow is from 
donor’s left common carotid artery to re- 
cipient’s left external jugular vein (Ext. Jug 
v.). Sup. thyroid a. — superior thyroid ar- 
tery. 


fore transplantation, and duration of therapy after 
transplantation. 


METHODS 


Transplantation Technique. We used LBN-f strain 
rats as donors. The transplantation technique has been 
published. '4 Briefly, the larynx-pharynx-thyroid-tra- 
chea anlage was harvested en bloc, with the bilateral 
carotid arteries and superior thyroid arteries intact. 
The donor organ was transplanted into a pocket in 
the left side of the recipient Lewis rat’s neck in tan- 
dem with the native larynx (Fig 1). Using microvas- 
cular technique, the surgeon anastomosed the donor’s 
right common carotid artery end-to-side to the recip- 
ient’s left common carotid artery. The donor’s left 
common carotid artery was used as an arteriovenous 
shunt for venous drainage and was anastomosed end- 
to-side with the recipient’s left external jugular vein. 


Trial Design. Paired donor and recipient rats were 
randomly assigned to the 22 treatment groups (see 
Table). In pairs assigned to irradiation arms, the har- 
vested donor larynges were bathed in Wisconsin so- 
lution and treated with 800 cGy in 1 fraction for 7.34 
minutes with a 3.2 x 3.2-cm field size 40 cm from 
the source, a 300-kV Siemens orthovoltage unit with 
a Thoreus filter. The larynges were then immediately 
transplanted. 


Animals assigned to receive cyclosporine received 
intramuscular administration of either 1.5 or 2.0 mg/ 
kg per day throughout the postoperative period. The 
prednisone groups received prednisone by gavage at 
0.1, 0.25, 0.5, or 1.0 mg/kg per day. The animals were 
sacrificed at either 15 or 30 days after operation. The 
control animals did not receive any postoperative im- 
munosuppression and were painlessly euthanized at 
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Fig 2. Mean rejection grades with standard devia- 
tions. Doses for cyclosporine (Cy) and prednisone Cyclosporine Cyclosporine No 
(P) are in milligrams per kilogram per day. Irra- 2.0 mg 15mg Cyclosporine XRT 
diation (XRT; X) given to donor larynges in vitro. A 
A) Groups harvested at 15 days. B) Groups har- 
vested at 30 days. 
End Stage 
o Adv-Mod 
$ 
(0) 
5 Moderate 
© 
2 
Q 
£ Mild 


Normal 


15 days after transplantation. 


Larynges were harvested by sedating the animals 
with chloral hydrate, inspecting the transplanted lar- 
ynx in vivo for gross evidence of rejection, resecting 
the organ, and immediately sacrificing the animal by 
pentobarbital injection. The larynges were then pro- 
cessed for standard histologic whole-organ section- 
ing. 

End Points. A histologic grading system of rejec- 
tion has been published. !5 Briefly, gross features of 
encapsulation, secretion production, and arterial and 
venous blood flow were graded. Histologically, re- 
jection changes to the epithelium, minor salivary 
glands, thyroid gland, cartilage, perichondrium, and 
muscle and the extent of lymphocytic infiltration 
were recorded. The organs were graded by a single 
reviewer in a blinded fashion using the published 
scale, which is based on the progression of rejection 
that would be expected in an animal that received no 
immunosuppression: “normal” (0 to 3 days after 
transplantation), “mild” (4 to 5 days), “moderate” (6 





C 


i Alone P1.0 P0.5 Alone P1.0 Il y2P1X Cy1.5P1X i 


Cyclosporine Cyclosporine XRT 
2.0 mg 1.5 mg 


to 7 days), “advanced-moderate” (7 to 10 days), “end- 
stage” (10 to 14 days), or “infarcted” (anastomotic 
thrombosis). 


Statistical Analysis. The Kruskal-Wallis test was 
used to compare mean rejection grades in the 22 treat- 
ment groups. Because the groups were clearly not 
identical in their results, they were then compared 
pairwise by the Wilcoxon rank sum test to determine 
which groups could be clearly distinguished from 
others. To maintain an overall significance level of 
& < 0.05 for this set of pairwise comparisons, a Bon- 
ferroni correction of œ < 0.00024 was used for de- 
termination of statistical significance. The tests were 
2-tailed and performed on Statistical Analysis Soft- 
ware, version 6.12 (SAS Institute, Cary, North Caro- 
lina). 


RESULTS 


All 220 recipient rats survived the operative pro- 
cedure and their designated postoperative 15- or 30- 
day period. After harvest and histologic sectioning, 
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10 specimens were graded as infarcted because of 
hemorrhage into the laryngeal structures and occlu- 
sion of arterial and venous vessels. Three of the in- 
farcted specimens were in the group given the com- 
bination of irradiation, cyclosporine (1.5 mg/kg per 
day), and prednisone (1 mg/kg per day) for 30 days. 
The other 7 infarcted specimens were distributed 
among the other 20 groups, with no more than | in- 
farction in any single group. The remaining 210 of 
the 220 specimens (95%) were viable and able to be 
graded. 


Figure 2A is a graph of the mean histologic grade 
of rejection for each group at 15 days after transplan- 
tation. All groups with cyclosporine at 2 mg/kg per 
day, with or without prednisone at the various dos- 
ages, had equally low rejection grading within error. 
Cyclosporine at 2 mg/kg per day protected against 
rejection better than cyclosporine at 1.5 mg/kg per 
day (with or without prednisone), although most of 
the differences did not reach the Bonferroni correc- 
tion level of 0.00024. The groups with cyclosporine 
at 2 mg/kg per day had rejection grades uniformly 
lower than those of the prednisone-alone groups, with 
all differences reaching statistical significance at 
0.00024. Last, the groups with cyclosporine at 1.5 
mg/kg per day all had lower rejection grades than 
did the prednisone-alone groups, but again, the dif- 
ferences did not reach the statistical significance level 
of the Bonferroni correction. 


The two 15-day groups that received in vitro irra- 
diation to the larynx immediately before transplanta- 
tion exhibited intermediate rejection grades. When 
these groups were compared to the groups that re- 
ceived the same drug regimens without irradiation, 
no statistically significant effect was found. 


Figure 2B graphs the mean histologic grade of re- 
jection for each group at 30 days after transplanta- 
tion. The group given 2 mg/kg per day of cyclospo- 
rine plus | mg/kg per day of prednisone had the low- 
est rejection grade — significantly lower than the 
grade for the group given 2 mg/kg per day of cyclo- 
sporine alone. The group given 2 mg/kg per day of 
cyclosporine plus 0.5 mg/kg per day of prednisone 
experienced rejection of intermediate severity. When 
the cyclosporine was reduced to 1.5 mg/kg per day, 
the animals that received 1.0 mg/kg per day of predni- 
sone experienced less severe rejection than those that 
received cyclosporine alone, although the difference 
did not reach statistical significance. 


The results for the irradiated groups at 30-day har- 
vest were mixed. Irradiation significantly worsened 
results for the animals given cyclosporine at 2 mg/ 
kg per day plus prednisone at | mg/kg per day, but 
made no difference for those given cyclosporine at 


1.5 mg/kg per day plus prednisone at 1 mg/kg per 
day. 


DISCUSSION 


At 15 days after transplantation, adding steroids 
to a cyclosporine regimen does not significantly re- 
duce the severity of rejection. A regimen of steroids 
alone is inferior to cyclosporine alone or to combi- 
nation cyclosporine-steroid therapy. However, by 30 
days after transplantation, the rejection rates with cy- 
closporine alone rise significantly, and adding pred- 
nisone maintains protection. In vitro radiotherapy 
does not have beneficial effects when used in com- 
bination with cyclosporine and prednisone, in con- 
trast to previous work that demonstrated a benefi- 
cial effect when radiotherapy was used adjunctively 
with cyclosporine alone. !3 


These results support what we have observed clini- 
cally in our laryngeal transplant patients. Initially, we 
chose to maintain cyclosporine blood levels in the 
400- to 700-ng/mL range, which is standard therapy 
in post-transplantation immunosuppression. Yet, in 
combination with 30 mg/d of prednisone, we were 
able to decrease the cyclosporine to maintain blood 
levels in the 150- to 200-ng/mL range without ob- 
serving any signs of rejection. The prednisone dos- 
age was subsequently decreased to less than 10 mg/ 
d with further alterations in the immunosuppressive 
regimen. (The details of our post-transplant immu- 
nosuppressive regimen will be reported elsewhere.) 


Combining cyclosporine and oral steroid therapy 
has not been studied before in laryngeal transplanta- 
tion, but the combination has been investigated in 
the renal and cardiac transplantation literature, with 
results similar to ours. Starzl et al8-? demonstrated in 
66 patients that adding steroids allows cyclosporine 
dosages to be decreased, with a corresponding de- 
crease in the incidence and severity of nephrotoxic- 
ity, while maintaining the accepted 1-year actuarial 
graft survival rate. In cardiac transplantation, Griffith 
et al!0 demonstrated that cyclosporine used in com- 
bination with low-dose steroids substantially de- 
creased the frequency and severity of rejection epi- 
sodes, as well as reducing infectious complications 
and hospital costs and allowing earlier patient reha- 
bilitation. 


Previous work by Barthel et al!? with a rat model 
demonstrated that cyclosporine levels could be de- 
creased with the addition of in vitro irradiation while 
maintaining low rejection levels. Although the cause 
of this immunomodulatory effect was unclear, it was 
hypothesized that irradiation decreased the viability 
of “passenger lymphocytes” that accompany the or- 
gan during transplantation, initiating the rejection 
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cascade when recirculation begins. In vitro irradia- 
tion was evaluated in the current study only when 
given in combination with prednisone and cyclospo- 
rine. As irradiation did not demonstrate an additional 
beneficial effect when given in this manner, we must 
conclude that the proven immodulatory effects of 
both prednisone and irradiation individually are not 
additive when given in combination. 


Limitations of Study. Our findings would have been 
strengthened if we had measured cyclosporine blood 
levels in the rats. Previous work in our laboratory 
has demonstrated that intramuscular cyclosporine 
dosing levels correlate with serum levels in rats (M. 
Haug et al, unpublished observations). In addition, 
our analysis could have been affected by our choice 
to exclude the infarcted specimens from statistical 
analyses. Infarction may represent either a severe 
stage of rejection or simply an unfortunate chance 
event that occurs in a minority of specimens. Our 
5% infarction rate is far lower than the rates of 30% 
to 50% in the initial description of this animal mod- 
el.!5 If the 3 infarctions in the group given 1.5 mg/ 
kg per day of cyclosporine plus 1 mg/kg per day of 





prednisone for 30 days plus irradiation are excluded, 
only 7 infarctions occurred in the other 210 trans- 
plantations, and not more than 1 in any group; this 
finding suggests that they were chance occurrences 
with an insignificant effect upon the final results. 


CONCLUSION 


The ultimate goal in transplantation research is to 
induce acceptance of a foreign organ without com- 
promising other functions of the immune system or 
causing toxicities from powerful immunosuppressive 
medications. !® Once this goal is achieved, there will 
be a tremendous increase in the indications for trans- 
plantation of nonvital organs such as the larynx. Until 
then, decreased toxicities and improved rates of graft 
survival remain acceptable goals of transplantation 
research. Using the murine model, we have demon- 
strated that rejection rates in laryngeal transplanta- 
tion can be reduced with combined cyclosporine and 
steroid therapy. These results suggest that combina- 
tion drug therapy is a reasonable approach to estab- 
lishing a laryngeal transplant immunosuppressive 
regimen. 
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LACK OF ASSOCIATION BETWEEN GENE POLYMORPHISM OF 
TRANSPORTERS ASSOCIATED WITH ANTIGEN PROCESSING AND 
ALLERGIC RHINITIS IN A JAPANESE POPULATION 
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Gene products of TAP] and TAP2 (transporters associated with antigen processing) are involved in antigen presentation. It has 
been reported that polymorphisms in the TAP! gene are associated with atopy in Tunisians. We designed a case-control study to 
investigate the potential association of polymorphisms of the TAP1 gene with allergic rhinitis in a Japanese population. We used the 
amplification refractory mutation system polymerase chain reaction to characterize TAP1 gene polymorphisms in 60 unrelated Japa- 
nese patients with allergic rhinitis and 62 healthy control subjects. There was no significant difference in the distribution of TAP] 
alleles between the patient group and the control group. These findings suggest that the TAPI gene is not primarily involved in 


susceptibility to allergic rhinitis in the Japanese population. 


KEY WORDS — allergy, amplification refractory mutation system polymerase chain reaction, gene polymorphism, TAP1. 


INTRODUCTION 


Antigen peptides are presented to self-restricted 
cytotoxic T lymphocytes by the HLA class I mole- 
cules. These peptides are degraded from cytosolic 
proteins and assembled with HLA class I heavy chain 
and §2-microglobulin in the endoplasmic reticulum. 
The peptides are actively transported across the en- 
doplasmic reticulum membrane by the transporters 
associated with antigen processing (TAP).!4 


TAPI and TAP2 genes encode a heterodimeric 
molecule involved in endogenous antigen process- 
ing.!35 The class I antigen—antigenic peptide com- 
plex is then expressed on the cell surface and recog- 
nized by CD8* T cells. Previous work has shown 
that exogenous antigens can be channeled into the 
endogenous pathway presented by HLA class I mole- 
cules and be recognized by CD8* T lymphocytes. 
Thus, TAP gene products are thought to be among 
the factors dictating the nature of endogenous and 
exogenous antigen stimuli. 


Both TAP1 and TAP2 gene products are members 
of the adenosine triphosphate (ATP)—binding cassette 
superfamily. TAP1 and TAP2 molecules possess | 
hydrophobic domain and 1 ATP-binding domain. Of 
the previously reported TAP! polymorphisms,’ | (at 
position 333) is located within the hydrophobic do- 
main and | (at position 637) is within the ATP-bind- 
ing domain. A variation at either position may alter 
the specificity of the substrate and the function of 
the TAP protein. Therefore, the polymorphism of TAP 


genes could have functional consequences that may 
alter immune response regulation. 


Ismail et alë have provided evidence of a strong 
association between TAP 1 polymorphism and atopy 
in a Tunisian population. In this study, we examined 
TAPI polymorphism in normal control subjects and 
allergic rhinitis patients in a Japanese population in 
order to test whether their observation can be gener- 
alized. 

SUBJECTS AND METHODS 


Subjects. The gene and allele frequencies of the 
TAPI gene were determined in a group of 62 healthy 
subjects and a group of 60 unrelated patients with 
allergic rhinitis. The patients (32 men and 28 women) 
had a mean age (SD) of 28.2 + 13.2 years. The pa- 
tients visited our outpatient clinic of Mie University 
Hospital. The diagnosis of allergic rhinitis was based 
on symptoms of sneezing, watery rhinorrhea, and na- 
sal obstruction and the following 3 criteria: eosino- 
philia in nasal smears, a positive skin test for allergy 
to house dust mites, and a positive nasal provoca- 
tion test for allergy to house dust mites. All of the 
patients with rhinitis were positive on history for all 
of the symptoms and all 3 tests for allergy. The skin 
test was performed with house dust extracts (Torii 
Pharmaceuticals Co Ltd, Tokyo, Japan). 


Sixty-two unrelated normal control subjects (32 
men and 30 women) unrelated to the patients were 
carefully selected from the same district. All of the 
control subjects were negative for all of the symp- 
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Typing TAPI positions 333 and 637 by amplification re- 
fractory mutation system polymerase chain reaction, de- 
scribed by Ismail et al.8 Four oligonucleotides were in- 
cluded in each polymerase chain reaction mix. Two of 
these, 1 sense and | anti-sense, were designed such that 
their 3' terminal nucleotides were complementary to | of 
2 variants of TAP1 nucleotides, which encode amino ac- 
ids 333 and 637, respectively. Two other oligonucleotides, 
1 sense and 1 anti-sense, were complementary to flank- 
ing sequences located at asymmetric distances on either 
side of polymorphic nucleotides. As for position 333, am- 
plification resulted in constant control product of 533 bp, 
together with single product of 241 bp, if sample was 
homozygous for Ile-333; single product of 352 bp, if sam- 
ple was homozygous for Val-333; or both products, if 
sample was heterozygous. As for position 637, amplifi- 
cation resulted in constant control product of 429 bp, to- 
gether with single product of 307 bp, if sample was ho- 
mozygous for Asp-637; single product of 180 bp, if sam- 
ple was homozygous for Gly-637; or both products, if 
sample was heterozygous. Products of 6 amplified reac- 
tion mixes after electrophoresis in 2% agarose gel are 
shown. Lanes 1 and 5 — molecular weight markers; lane 
2 — position 333 heterozygote; lane 3 — Ile 333 homo- 
zygote; lanes 4 and 8 — negative controls; lane 6 — po- 
sition 637 heterozygote; lane 7 — Asp-637 homozygote. 


toms and the 3 parameters mentioned above. They 
had a mean age of 29.9 + 9.6 years, which is not 
significantly different from that of the patient group. 


DNA Extraction and Oligonucleotides. Genomic 
DNA was extracted from 100 uL of peripheral blood 
by a Sepagene (Sankojunyaku, Tokyo) kit. To ana- 
lyze a single polymorphic site of the TAP1 gene, we 
included 4 oligonucleotides in a single reaction mix. 
The nucleotide sequences of the primers are reported 
in the literature.’ 


Typing TAP Polymorphism. We used the ampli- 
fication refractory mutation system polymerase chain 
reaction technique to characterize TAP! gene poly- 
morphism in Japanese patients with allergic rhinitis. 
Genomic DNA samples (1 ug) were amplified in 100 
uL of reaction mixture containing 0.25 ug of each 
oligonucleotide primer (4 primers), 200 mol/L de- 
oxynucleotide triphosphates, 1x Taq DNA polyme- 
rase buffer, and 2 units of Taq DNA polymerase (Ta- 
kara, Ohtsu, Japan), overlaid with mineral oil. The 
reaction conditions used with the thermal cycler were 
as follows: 95°C for 5 minutes; 35 cycles of 94°C for 
1 minute, 58°C for 2 minutes, and 72°C for 2 min- 
utes; 72°C for 10 minutes. The reaction products were 


TABLE 1. FREQUENCY OF EACH OF FOUR POSSIBLE 
TAP1 ALLELES IN THREE POPULATIONS AS 
PROPORTION OF TOTAL NUMBER OF 














CHROMOSOMES ANALYZED = 
TAP! Alleles Frequency (% jv l 

Codon Japanese Tunisian’ E uropean? 

333. .637 (2n=124) (2n= 162) (2n = 138) 

A Ile Asp 82.3 83.5 848 — 
B Val Gly 9.6 9.5 13 
C Va Asp 0.8 2 2.2 
D Ile Gly 1:3 5 0 


2n — number of chromosomes in each group. 





separated on a 2% agarose gel and stained with eth- 
idium bromide. An ethidium bromide-stained gel is 
shown in the Figures to illustrate a typical TAP1 poly- 
morphism analysis. 


Statistical Analysis. The %? test with Yates’ cor- 
rection was used to test for a significant association 
between allergic rhinitis and TAP] alleles and geno- 
types. The statistical analysis was performed with 
Stat View software, version 4.5. 


RESULTS 


Frequency of TAP! Alleles in Japanese Popula- 
tion. The possible alleles of TAP1, based on known 
polymorphism, are shown in Table 1, together with 
the frequency at which each allele was observed in 
Japanese (control subjects of this study), Tunisian,® 
and European populations.’ 


The TAP] allele frequencies in the Japanese popu- 
lation were similar to those in the Tunisian popula- 
tion, but were different from those in the European 
population in that the TAP1-D allele was not observed 
in the Europeans. 


Association Between Allergic Rhinitis and TAP 1 
Polymorphism. The TAP1 gene and allele frequen- 
cies are shown in Table 2. No significant differences 
were observed in the TAP 1 gene and allele frequen- 
cies between control subjects and allergic rhinitis pa- 
tients. The TAP1 genotypes are shown in Table 3. 
No significant differences were observed in TAP] 
genotypes between the two groups. 


DISCUSSION 


Susceptibility to atopic diseases is known to be 
associated with genetic components, and several can- 
didate genes have been reported. Noguchi et al!” pro- 
vided evidence for linkage between asthma and gene 
markers in or near the interleukin-4 gene, the interleu- 
kin-9 gene, and D5S393 on chromosome 5q31-q33 
in a Japanese population. Moffatt and Cookson!! de- 
scribed an association between tumor necrosis fac- 
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TABLE 2. TAP1 GENE AND ALLELE FREQUENCIES IN 
CONTROL SUBJECTS AND ALLERGIC RHINITIS 


TABLE 3. DISTRIBUTION OF TAP1 GENOTYPES IN 
CONTROL SUBJECTS AND ALLERGIC RHINITIS 


























PATIENTS PATIENTS 
Control Allergic Rhinitis Allergic Rhinitis 
Codon (2n = 124) (2n = 120) Control (n = 62) (n = 60) 
333 637 n % n % p Genotype n % n % p 
a Val 13 10.5 11 9.2 NS a/a 1 1.6 1 1.7 NS 
b Ile 111 89.5 109 90.8 NS a/b 11 177 9 15.0 NS 
È Asp 103 83.1 104 86.7 NS c/d 19 30.6 12 20.2 NS 
d Gly 21 16.9 16 13.3 NS d/d l 1.6 2 33 NS 
A Ile Asp 102 82.3 102 85.0 NS A/A 42 67.7 44 36.7 NS 
B Val Gly | Ba 9.6 9 LS NS A/B ïi 77 7 117 NS 
D Ie Gly 9 73 7 58 NS A/D 7 113 5 83 NS 
i NS — not significant. B/B 0 0 l | 4 NS 
n B/C l 1.6 0 0 N 
tor phenotypes and asthma. Ismaïl et al8 reported a = 0 0 0 0 A 
strong association between TAP1 gene polymor- CIC 0 0 0 0 NS 
phism and 3 atopic diseases (allergic asthma, aller- C/D 0 0 0 0 NS 
ic rhinitis, atopi rmatitis) in Tunisians. 
gic rhinitis, atopic dermatitis) in Tunisians D/D j 16 | L7 NS 


We tried to elucidate an association between TAP 
gene polymorphism and allergic rhinitis in a Japa- 
nese population, following the protocol of Ismail et 
als exactly in performing the amplification refrac- 
tory mutation system polymerase chain reaction tech- 
nique. Although the sample size is small in this study, 
we could not find any association between them. The 
lack of association in the Japanese population prob- 
ably resulted from ethnic differences between Japa- 
nese and Tunisians. Kuwata et al!? could not find 
any differences in TAP allele frequencies between 
atopic dermatitis patients and control subjects in their 
study of a Japanese population; their finding sup- 
ports ours. Associations of TAP gene polymorphism 
and susceptibility to diseases are known for some 





other diseases, such as juvenile-onset psoriasis!3 and 
juvenile rheumatoid arthritis.'4 The mechanism of 
these associations is largely unknown. Because TAP 
polymorphism does not influence transport of pep- 
tide variants in mice or humans,!> a major contribu- 
tion of human TAP polymorphism to disease progres- 
sion and autoimmunity is not very likely. 


In conclusion, no association was observed be- 
tween TAP] gene polymorphism and allergic rhini- 
tis patients in a Japanese population. The relation- 
ship described by Ismail et al8 between the polymor- 
phism of this gene and the development of allergic 
diseases cannot be generalized. 
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INTRODUCTION 


Advances in and widespread use of prenatal ultra- 
sound have improved the perinatal management and 
survival of fetuses who have large cervical masses, 
especially in the presence of airway obstruction. Pre- 
natal magnetic resonance imaging (MRI) may be 
helpful in selected cases for precisely characterizing 
and evaluating the location of a mass and its relation- 
ship to the upper airway.!-3 We present a case of fe- 
tal cervical lymphangioma diagnosed by prenatal ul- 
trasound and MRI, and discuss the value of prenatal 
diagnosis to perinatal management. 


CASE REPORT 


A 27-year-old multiparous woman underwent rou- 
tine antenatal ultrasonography at 36 weeks’ gestation. 





Fig 1. Coronal HASTE (half-Fourier, single shot turbo 
spin echo) magnetic resonance image through neck of 
fetus at 36 weeks’ gestational age. There is large cystic 
mass in left side of neck, compatible with lymphangioma. 
Trachea (arrow) and hypopharynx (arrowhead), which 
are filled with fluid, are easily identified, and are not com- 
pressed by mass. 


The fetal sonogram showed a 10 x 7 x 6.5-cm cystic 
mass in the left side of the neck, entering the thorax 
and touching the trachea. The ultrasonographic find- 
ings were compatible with the diagnosis of a lymph- 
angioma. Ultrafast fetal MRI, performed to precisely 
delineate the relationship between the trachea and 
the mass, showed that the lymphangioma did not com- 
press the trachea, the larynx, or the hypopharynx (Fig 
1). It extended inside the posterior neck, toward the 
left shoulder (Fig 2). At 38 weeks of gestation, an 
elective cesarean section was performed. A multidis- 
ciplinary team including obstetricians, anesthesiolo- 
gists, intensive care pediatricians, and otolaryngolo- 
gists was present. After delivery, the child was placed 
on an adjacent table without interruption of the um- 
bilical cord. The placenta was left in situ (operation 
on placenta support [OOPS] procedure). A few sec- 
onds after delivery, the neonate began to breathe spon- 
taneously with no signs of airway obstruction. The 
umbilical cord was then clamped and the placenta 
was extracted. Upon examination, the mass was seen 





Fig 2. Sagittal HASTE magnetic resonance image shows 
extension of lymphangioma to nuchal area. 
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to cover all of the left side of the neck from the pinna 
superiorly to the clavicle and shoulder inferiorly and 
to the nape of the neck posteriorly. Cytologic exami- 
nation of fluid aspirated from the cyst confirmed the 
diagnosis of a lymphangioma. A computed tomogra- 
phy (CT) scan of the brain, neck, mediastinum, and 
abdomen did not reveal any other malformations. The 
child was discharged 4 days after delivery. 


DISCUSSION 


Fetal MRI is not a new tool. The first reports on 
its use are from the early 1980s.4 However, in those 
years, the length of time required for acquiring an 
image, along with fetal motion, led to poor-quality 
images. Ultrafast MRI, available today, allows fast 
image acquisition with no artifacts due to fetal mo- 
tion. In our case, the half-Fourier, single shot turbo 
spin echo (HASTE) technique was used. It enabled 
a very short image acquisition time of only 400 ms. 


As with other prenatal examination tools, the ques- 
tion of the safety of prenatal MRI has been posed. 
The few studies that have been performed have dem- 
onstrated that there is no danger to the fetus.>-7 There- 
fore, the MRI may be used relatively safely for prena- 
tal evaluation. 


Fetal cervical masses are associated with a high 
rate of perinatal mortality due to airway obstruction. 
The OOPS procedure has been described for allow- 
ing safe delivery in these cases.89 The neonate is de- 


livered by a cesarean section and is placed on a table 
adjacent to the mother without interrupting the um- 
bilical cord. Fetomaternal circulation is therefore 
maintained. The umbilical cord is clamped only af- 
ter the airway has been secured. Another technique 
that may also be used is the ex utero intrapartum treat- 
ment (EXIT) procedure.!9.1! After the cesarean sec- 
tion, only the head and one hand are exposed. The 
rest of the body is delivered after intubation or tra- 
cheotomy, and the umbilical cord is clamped. We de- 
cided to use the OOPS procedure, even though the 
prenatal MRI did not show signs of airway obstruc- 
tion, in order to allow the safest delivery. 


The precise rate of fetal cystic hygroma is un- 
known. However, an incidence as high as 1% of preg- 
nancies has been reported.!2 The cystic hygroma may 
be a part of a malformation syndrome. The most com- 
mon syndrome is the Turner syndrome, found in 50% 
to 70% of cases of fetal cystic hygroma.!3 In our case, 
the CT scan did not show other malformations. Kar- 
yotyping was not performed immediately after de- 
livery, and the child was not available for follow-up 
after discharge from the hospital. 


In conclusion, although ultrasound remains the pri- 
mary mode of fetal evaluation, ultrafast MRI may 
be useful in selected cases to precisely assess the na- 
ture and location of masses or malformations. The 
relationship between cervical masses and the airway 
is especially relevant for establishing a successful and 
safe planning of delivery. 
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INTRODUCTION 


A subset of craniofacial ossifying fibromas are 
characterized by a predilection for the sinonasal tract 
and orbit, with a tendency to affect younger age groups 
(first and second decades), potential for infiltrative 
growth, and a distinctive histologic profile, includ- 
ing psammomatoid ossicles.! The term aggressive 
psammomatoid ossifying fibroma (APOF) indicates 
the clinical behavior and histomorphological appear- 
ance of these lesions.' Clinicopathologic correlation, 





Fig 1. Computed tomography scan in coronal section with 
contrast enhancement. Expansile mass involves ethmoid 
sinus with anterior cranial fossa invasion, surrounded by 
well-demarcated bony shell. This mass expands lateral 
ethmoid wall, impinging on orbit. 
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particularly with the radiographic features, is essen- 
tial in establishing a specific diagnosis. ! 


CASE REPORT 
An 18-year-old man presented with sinusitis and 
nasal obstruction in 1992. He underwent an incom- 
plete remeval of a naso-ethmoid-sphenoid mass by 
an endonasal route. Six years later, he was admitted 





Fig 2. Tl-weighted magnetic resonance image in coro- 
nal section with gadolinium enhancement. Discrete right 
ethmoid mass, hypointense to muscle with outer hyper- 
intense rim. invades anterior cranial fossa. This mass de- 
forms right lateral ethmoid wall and impinges on right 
orbit. 
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Fig 3. Biopsy specimen shows multiple small round 
psammomatoid ossicles embedded in cellular fibrous 
stroma (H & E, original x100). 


to our department, referred by the other center, be- 
cause of a recurrence of a sinusal mass found on a 
routine examination. Physical examination showed 
a left eye amaurosis due to old trauma. He was ortho- 
phoric and had full ductions. Computed tomography 
(CT) revealed an expansile lesion involving the right 
ethmoid and frontal sinuses and the anterior cranial 
fossa. The lesion appeared heterogeneous and was 
surrounded by a hyperdense bony shell (Fig 1). Mag- 
netic resonance imaging (MRI) showed a 3.5 x 2.5 x 
4-cm cystic mass surrounded by a hyperintense solid 
component that was enhanced by gadolinium and an 
outer hypointense ring that correlated with bone. This 
mass occupied the ethmoid bone and frontal sinus 
and extended into the cranial cavity, pushing up the 
right frontal lobe (Fig 2). He underwent a frontal cra- 
niotomy and extradural tumor removal via the right 
subfrontal approach. During surgery, we found a yel- 
lowish mass covered by a bony shell that insufflated 
the right anterior cranial fossa and right frontal and 
ethmoid sinuses and impinged on the internal wall 
of the right orbit. The tumor was grossly removed 
macroscopically. The anterior cranial fossa was re- 
constructed with a frontal fascia flap and acrylic glue. 
Postoperative MRI showed a total removal of the 
mass. Histologic analysis showed a benign fibro-os- 





Fig 4. T1-weighted magnetic resonance image. Tumor is 
hypointense to muscle. 


seous proliferation composed of bony spicules and 
spherules admixed with fibrous stroma. The most 
relevant feature was the small mineralized or calci- 
fied (psammomatoid) bodies, admixed with cellular 
stroma, with a variable amount of myxomatous ma- 
terial (Fig 3). All of the above features gave us the 
diagnosis of APOF. The postoperative course of the 
patient was uneventful. After 2 years of follow-up, 
he appears disease-free on radiologic examinations. 


DISCUSSION 

The radiographic features of APOF are invaluable 
in diagnosis and in determining the extent of growth. ! 
On CT, the lesions are expansile and circumscribed 
by a thick shell of bone density! with a multiloculated 
(septations of bone density) internal appearance and 
a content of varying density.!~ On bone windows, 
abnormal bone of the tumor can be differentiated 
from surrounding bony structures.* Psammomatoid 
ossicles appear radiolucent or like ground glass.4 On 
MRI, APOF shows a cystic mass with an outer hypo- 
intense shell suggesting osteoid matrix calcification.” 
An APOF is hypointense to muscle (Fig 4) on T1- 
and T2-weighted sequences. The bony walls are iso- 
intense with gray matter on T1-weighted images and 
hypointense on T2-weighted images.* The enhance- 
ment of the bony walls with gadolinium suggests that 
they are not a reactive hyperostosis, but a tumor tis- 
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sue, ie, part of the proliferative process. 


Histopathologic examination shows small round 
ossicles to be the predominant form of proliferation 
of APOF. The ossicles vary in appearance and in num- 
ber, and they include the presence of osteocytes, in- 
dicating a lesion of osteogenic origin.! An APOF has 
lamellar bone trabecules with an osteoblastic rim and 
a cellular stroma with a fascicular storiform growth 
composed of round to polyhedral to spindle-shaped 
cells with prominent basophilic nuclei and inappar- 
ent cytoplasmic borders.! Cellular pleomorphism 
could be apparent, but anaplasia and necrosis are not 
identified.! An associated basophilic-appearing myx- 
omatous stroma is identified, admixed with the cel- 
lular spindle cell proliferation. Focally, cystic degen- 


eration of the myxoid stroma can be seen.! Giant cells 
are seen focally among the psammomatoid ossicles 
or scattered throughout the nonosseous stromal com- 
ponent.! 


Surgery is the treatment of choice.!-2 Complete 
surgical resection should be the goal of treatment, 
regardless of the technique used. Incomplete resec- 
tion of aggressive ossifying fibroma will likely lead 
to symptomatic recurrence and can cause significant 
morbidity. !:2 Complete surgical excision may not al- 
ways be feasible. In those cases, we recommend re- 
moval of as much as possible and regular radiologic 
follow-up, because adjuvant therapy such as irradia- 
tion has no proven efficacy in the treatment of APOF.! 
The prognosis in APOF is considered to be good.! 
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BOOK REVIEWS 


Color Atlas of the Anatomy and Pathology of the Epitym- 
panum 


Tauno Palva, in collaboration with Hans Ramsay and Clarinda 
Northrop. Hard cover, illustrated, indexed, 104 pages, 2001. 
Karger, New York, NY, $223.50. 


Purpose: To convey an understanding of the epitympanic 
compartments and their origin. During his career, Dr Palva was 
dissatisfied with the articles discussing anatomy of the epitym- 
panum. They did not reflect the facts. He has spent the past 10 
years, in retirement, investigating the complicated anatomy of 
this area. 


Contents: There are 3 sections: 1) Anatomy and Pathology of 
the Epitympanum, 2) Pathology Related to Amniotic Fluid Cel- 
lular Content and Superimposed Infection, and 3) Microsurgi- 
cal Approaches to Inflammatory Ear Disease. 


Attributes and Limitations: The thrust of this work is to aerate 
or alleviate disease in the epitympanum in those patients who 
have retraction pockets in the attic or Prussak’s space. Usually, 
cases in which multiple tubes have been needed have epitympa- 
num disease. Whereas the surgeon would traditionally have 
removed the diseased incus, malleolar head, and all attached 
folds, ligaments, and membranes in the canal wall-down proce- 
dure for cholesteatoma, the author portrays techniques to use 
with canal wall—intact procedures and the frontolateral atticot- 
omy approach in generalized cholesterol granuloma, mastoidi- 
tis, and epitympanitis. The author does not discuss reconstruc- 
tion techniques. It would be perplexing if some type of recon- 
struction was not done to the lateral wall after any extensive 
atticotomy. 


According to the author, the epitympanic work of the pio- 
neers, von Tréltsch, Prussak, Siebenmann, and others cannot be 
improved upon. Von Triltsch first described the “pouches of the 
membrana tympani” in 1856 and again in 1857, reporting that 
the anatomist Friedrich Arnold had earlier noticed the pouches, 
which he called “plica membranae mucosae anterior et poste- 
rior.” Von Tréltsch showed that these structures of the tympanic 
membrane were not folds, but rather, curved ridges protruding 
from the membrane at the short process of the malleus and con- 
tinuing forward and backward. In 1867, the Russian otologist, 
Prussak, described the superior pouch of the tympanic mem- 
brane (Prussak’s space), which has connection via the end por- 
tion of the posterior pouch. 


Applicability: The volume is more of a combined atlas (be- 
cause of the 171 figures, 166 in color) and text (because of the 
amount of literary work). It is elegantly done and supported by 
amedical career of clinical and laboratory research. It should be 
studied by all who wish to achieve surgical competency of ana- 
tomic knowledge in otology. 


DENNIS G. PAPPAS, MD 
Birmingham, Alabama 


Color Atlas of Oral Diseases in Children and Adolescents 


George Laskaris. Hard cover, illustrated, indexed, 338 pages, 
1997. Thieme, New York, NY, $129. 


Purpose: This book was written to help overcome the diag- 


nostic and treatment difficulties that arise in treating children ` 


with oral diseases. It is targeted at dentists, oral surgeons, family 
practitioners, pediatricians, and otolaryngologists. The author 
sought to organize the information on a uniform basis, with 
emphasis on brevity and clarity of the written summaries com- 
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bined with high-quality pictorial material. 


Contents: The book covers local, systemic, and neoplastic 
diseases involving the teeth and mouth. The Local Diseases sec- 
tion covers an extensive list of dental defects, along with devel- 
opmental anomalies. Sections are also devoted to mechanical 
and chemical burns, foreign body deposits, and periodontal dis- 
eases. A wide variety of infectious and noninfectious diseases of 
the mouth are likewise covered. The Systemic Diseases section 
covers genetic disorders, along with the oral manifestations of 
skin, autoimmune, gastrointestinal, hematologic, and metabolic 
diseases. Bacterial, viral, and human immunodeficiency virus— 
related oral conditions are also highlighted. The final section, on 
Tumors, covers both common and uncommon benign and ma- 
lignant lesions, as well as fibro-osseous and giant cell lesions. 
An extensive list of references is provided. 


Attributes: The book does an excellent job of giving a succinct 
description of the disease process and its treatment. Each entity 
has a brief discussion of definition of the disease, etiology, 
occurrence in children, clinical features, localization, differential 
diagnosis, and treatment. Pertinent aspects of the clinical fea- 
tures are elucidated in bullet list fashion, serving as a quick and 
functional resource in this area. Some conditions contain sec- 
tions on laboratory tests, radiography, and histopathology. High- 
quality photographs of every condition are included, along with 
radiographs and pathological sections where appropriate. Sources 
are referenced for further study where necessary. 


Limitations: The book is not meant to be a compendium; how- 
ever, it is very complete. Color photography frequently fails to 
adequately demonstrate the 3-dimensional aspects of some con- 
ditions. That notwithstanding, these photographs are definitely 
the best oral pathology photographs I have seen. 


Applicability: This is an excellent quick reference book for 
any medical or dental professional who deals with conditions of 
the mouth. Recognizing the patterns of pathology is crucial to 
accurate diagnosis, and this book provides excellent examples 
with outstanding photography that will aid significantly the di- 
agnostic ability of the reader. 


DONALD J. WITTICH, JR, MD 
Hoover, Alabama 


Head and Neck Surgery—Otolaryngology 


Byron J. Bailey, Karen H. Calhoun, Gerald B. Healy, Jonas T. 
Johnson, Robert K. Jackler, Harold C. Pillsbury TI, and Eu- 
gene M. Tardy, Jr, editors. Third edition. Hard cover, illu- 
strated, 2,559 pages in volume 1 and 1,196 pages in volume 
2, 2001. Lippincott Williams & Wilkins, Philadelphia, Pa, 
$335. 


Purpose: To create a comprehensive textbook dealing with 
the specialty of head and neck surgery—otolaryngology. 


Contents: All aspects of otolaryngology—head and neck sur- 
gery are included, along with the subject of general medicine, 
which includes genetics, molecular biology, and anesthesiol- 
ogy. 

Attributes and Limitations: Ten years have passed since the 
first edition of these 2 volumes, yet the editors have failed to 
make a clear analysis in the preface of the dramatic advances in 
most aspects of the specialty that, fortunately, are included in 
the text. The preface of the 2 volumes of 1993 is almost identical 
tothatofthe presentvolumes. Typically, one would like to know, 
up front, in such a volume(s) that there is now included a “new 
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frontier” section (imaging technology, alternative medicine, in- 
terpreting literature, and business of medicine); and that there is 
a virtual revolution in our concept of cochlear micromechanics, 
in that the third edition coincides with the cloning of a protein 
specific to the outer hair cells that likely is a motor protein. Other 
features could be mentioned. 


Even so, in keeping with the first editions of these volumes, 
the authors have maintained a scientific and academic atmo- 
sphere without losing its highly practical flavor. As in the pre- 
vious volumes, the tables and summary are highlighted, and this 
is a plus. There is up-to-date information on physical modalities 
—- for example, stereotactic radiosurgery of acoustic neuromas. 
The references are up-to-date. Although most illustrations are 
identical to those of the previous editions, they remain a great 
strength. Although black-and-white (some in color), they are 
well chosen and sharp, and many are quite extraordinary. 


. In essence, there is not of lot of revision between the editions 
(10 years); seemingly relevant innovations and new material is 
confined to the text part. Such material may not be part of the 
“highlighted” summaries, and much of the latter are in the same 
words as the old volumes. In the current volume, genetics is 
updated, and the amount of general genetic knowledge communi- 
cated is good, but too general. For example, there is no elabora- 
tion on the otolaryngological genetic findings and clinical ad- 
vances that have occurred over the previous 10 years. A refer- 
ence book of this magnitude should mention and elaborate on, 
for example, potassium channel defects associated with Jervell- 
Lange-Nielsen syndrome; the association of the gene responsi- 
ble for Pendred’s syndrome with an enlarged vestibular aque- 
ducti and so on. 

Applicability: These volumes were prepared to be of use to all 
otolaryngology—head and neck physicians. 


DENNIS G. PAPPAS, MD 
Birmingham, Alabama 


Atlas of Head and Neck Surgery—Otolaryngology 


Byron J. Bailey, Karen H. Calhoun, Norman R. Friedman, S. 
Newlands, and J. T. Vrabec, editors. Second edition. Hard 
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Contents: All aspects of head and neck surgery-otolaryngol- 


ogy are included, and the volume is nicely sectioned into ana- 
tomic sites of the specialty. This comprehensive atlas is a com- 
panion volume to the multivolume textbook and study guide 


Head and Neck Surgery—Otolaryngology. It is a second edition ` 


printing; the previous one was published approximately 10 
years ago. The prefaces of each book are similar, so what is 
different between the two editions cannot be deciphered. 
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tions occurred in approximately 3 of 380 subjects. Incidentally, 
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“Comprehensive” is the word. For head and neck surgery—oto- 
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cialties put into one. The only color added in the illustrations is 
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INTRODUCTION 


The Eighth Symposium on Cochlear Implants in 
Children was held March 1-3, 2001, in Los Angeles, 
California. This conference was attended by more 
than 700 otolaryngologists, scientists, audiologists, 
speech-language pathologists, educators of the deaf, 
and others involved with children who are deaf or 
hearing-impaired. The participants came from around 
the United States, as well as many other countries. 


Sponsored by the House Ear Institute and House 
Ear Clinic, the Symposium focused on emerging re- 
search and technology in the following diverse ar- 
eas: candidacy, surgery, speech perception and pro- 
duction, language and cognitive development, fitting 
and programming of implant devices, plasticity, and 
rehabilitation and education. The manuscripts in- 
cluded in this monograph represent a peer-reviewed 
sample of the many excellent papers delivered at the 
Symposium. 


The success of this Symposium was largely due 
to our special invited speakers, who provided cur- 
rent reviews and new perspectives in the areas of 
electrophysiology, plasticity, hearing aid fitting, and 


infant speech perception and production. This publi- 
cation is dedicated to the memory of one of those 
special invited speakers, Peter W. Jusczyk, PhD. 


Publication of this monograph has beenmade possi- 
ble by support from the National Institutes of Health/ 
National Institute on Deafness and Other Communi- 
cation Disorders (R13 DC04541-01), Advanced Bi- 
onics Corporation (Sylmar, California), Cochlear Cor- 
poration (Englewood, Colorado), and Med-El Corpo- 
ration (Durham, North Carolina). 


We express our sincere gratitude to those mem- 
bers of the House Ear Institute and House Ear Clinic 
who played an integral role in the success of this 
Symposium, as well as to Stephanie Pogorelsky, Amy 
Martinez, and Barbara Serrano for their assistance 
in preparing these published proceedings. We also 
thank Dr Brian F. McCabe and the editorial staff of 
the Annals of Otology, Rhinology & Laryngology for 
publishing this monograph. 

WILLIAM M. Luxrorp, MD 
LAURIE S. EISENBERG, PHD 
MARGARET WINTER, MS 
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MATURATION OF HUMAN AUDITORY CORTEX: IMPLICATIONS FOR 
SPEECH PERCEPTION 


JEAN K. MOORE, PHD 
LOS ANGELES, CALIFORNIA 


This project traced the maturation of the human auditory cortex from midgestation to young adulthood, using immunostaining of 
axonal neurofilaments to determine the time of onset of rapid conduction. The study identified 3 developmental periods, each char- 
acterized by maturation of a different axonal system. During the perinatal period (3rd trimester to 4th postnatal month), neurofilament 
expression occurs only in axons of the marginal layer. These axons drive the structural and functional development of cells in the 
deeper cortical layers, but do not relay external stimuli. In early childhood (6 months to 5 years), maturing thalamocortical afferents 
to the deeper cortical layers are the first source of input to the auditory cortex from lower levels of the auditory system. During later 
childhood (5 to 12 years), maturation of commissural and association axons in the superficial cortical layers allows communication 
between different subdivisions of the auditory cortex, thus forming a basis for more complex cortical processing of auditory stimuli. 


KEY WORDS — auditory cortex, development, neurofilament, thalamocortical axon. 


In choosing the time for intervention with a pros- 
thetic device in a hearing-impaired child, it should 


_ be useful to have some idea of the stage of matura- 


tion of the child’s auditory system. In particular, we 
would like to know something of the status of higher 
levels of the central auditory system in which acous- 
tic information is made relevant to behavior, includ- 
ing language. In the past decade, the Department of 
Neuroanatomy at the House Ear Institute has carried 
out studies of the structural development of the hu- 
man central auditory system. This paper briefly re- 
views what we have learned about the maturation of 
the human auditory cortex and the implications of 
these findings for the development of speech percep- 
tion. 


The developmental work in the neuroanatomical 
laboratory was carried out in human postmortem 
brain tissue collected at the University of Southern 
California—Los Angeles County Hospital during neu- 
ropathologic autopsy. Tissue was collected for an age 
series extending from the 16th gestational week to 
27 years, ie, through the fetal period, infancy, child- 
hood, and young adulthood. In our laboratory, the 
tissue samples were embedded in celloidin, sectioned 
on a microtome, and used for a variety of histologic 
and immunohistochemical procedures. From each tis- 
sue sample, one series of every eighth section was 
stained for cells by Nissl’s method, and the interven- 
ing sections were immunostained with antibodies that 
link to different structural elements of neurons and 
glial cells. 


One type of immunostaining that was particularly 
useful in tracing development was an antibody to ax- 





onal filaments. Neurons form their axons quite early 
in development, and the newly generated axons are 
extremely thin; they contain very little in the way of 
axoplasm or cytoskeletal (cell framework) elements. 
There are shifting ionic currents, but no rapid action 
potentials flow down the length of the immature ax- 
ons. When the time arrives for axons to begin to func- 
tion, they undergo a transformation in which they 
increase in size, create a framework of neurofilaments 
within their axoplasm, and induce surrounding glial 
cells to encircle the axons with a myelin sheath. Once 
this has occurred, the axons begin to transmit infor- 
mation by rapid conduction. Because neurofilament 
proliferation within an axon immediately precedes 
myelination and rapid conduction, immunolabeling 
of axonal neurofilaments is a useful marker of the 
onset of function in any neuronal system. 


The usefulness of neurofilament immunostaining 
as a marker of onset of function was illustrated in 
our previous studies of the development of the hu- 
man brain stem auditory pathway. Brain stem audi- 
tory axons develop an adult-like content of neurofila- 
ments between the 16th and 28th fetal weeks.! Mye- 
lination of the axons first occurs between the 26th 
and 28th weeks of development,” and by the 29th 
fetal week, the presence of click-evoked brain stem 
auditory potentials indicates that rapid synchronous 
conduction is occurring in brain stem pathways.3 
These findings indicate that the onset of information 
conduction through the brain stem occurs around the 
26th to 28th fetal weeks, at the beginning of the third 
trimester of development. A rapid increase in axonal 
conduction velocity and synaptic transmission time 
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Fig 1. Neurofilament-immunostained sections of corti- 
cal tissue. At 40th fetal week (fw) and at 4.5 months’ 
postnatal age, mature axons are present only in marginal 
layer. By 2 years of age, mature neurofilament-express- 
ing axons are entering deeper cortical layers. By 11 years, 
mature axons are present with adult-like density in all 
cortical layers. 


occurs in brain stem pathways during the perinatal 
period.*+ These findings demonstrate that the brain 
stem auditory pathway is relatively mature at the time 
of term birth and is functioning in a very adult man- 
ner in the first year of life. However, our studies of 
the cortex produced a very different picture of de- 
velopment, one that extends considerably beyond the 
perinatal period.> With neurofilament immunostain- 
ing, we observed 3 periods of cortical development, 
each characterized by the maturation of a different 
axonal system. 


PERINATAL PERIOD 


From the third trimester of gestation to the fourth 
postnatal month, mature axons are present only in 
the most superficial layer of the cortex, the marginal 
layer. This superficial band of neurofilament-express- 
ing axons is present at the 40th fetal week, ie, at the 
time of term birth, and is still prominent at 4.5 months 
after birth (Fig 1). The marginal layer, also called layer 
1, consists of axons that run parallel to the cortical 
surface for long distances, often as far as several mil- 
limeters. In their course, the axons contact large num- 
bers of apical dendritic tufts, the branched tips of up- 
wardly extending dendrites of neurons lying in the 
deeper layers of the cortex (Fig 2°). Through these 
apical dendrites, marginal layer axons drive the ac- 
tivity of the deeper cells and are believed to promote 
their structural and functional maturation.’ Although 
marginal layer axons undoubtedly have an impor- 
tant stimulatory role in the rapidly developing cor- 
tex of the late fetal period and first year of life, they 





Fig 2. Illustration of laminar organization of cortex. Nu- 
merals | to 6 indicate cortical layers. WM — deep white 
matter. Adapted from Ramon y Cajal.® 


constitute an intracortical system that carries little 
or no information on external auditory stimuli. 


Many studies attest to the auditory discrimination 
abilities of perinatal infants. Infants in the first months 
of life can accurately distinguish speech sounds that 
differ in a variety of acoustic characteristics*-!3 and 
vary across different speakers. !4-15 In addition to dis- 
tinguishing individual speech sounds, infants at 2 to 
4 months of age detect differences in the segmenta- 
tion of multisyllable phrases!® and changes in the 
word order of sentences.!7 Previously heard stimuli 
are retained for as long as 24 hours,!8 and stimulus 
retention is enchanced by syllable organization!’ and 
prosodic linkage.”° This precise processing of stimu- 
lus parameters in the first months of life is accom- 
plished by infants with a very mature cochlea and 
brain stem, but a cortex with mature axons only in 
layer |. Given that there is no route for transmission 
of auditory information to the cortex at this stage of 
development, it seems likely that the accurate stimu- 
lus discrimination evidenced by infants younger than 
5 to 6 months reflects the analytical abilities of the 
brain stem. 


EARLY CHILDHOOD 


Between 4.5 months and 1 year of age, neurofila- 
ment-expressing axons first appear in the white mat- 
ter core of the temporal lobe, with axons radiating 
up into the deeper layers 4, 5, and 6. By 2 years of 
age, a light plexus of vertical and horizontal axons is 
apparent in the deeper cortical layers (Fig 1), and 
this plexus becomes progressively denser by 3 to 5 
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years. The course and termination of these axons 
clearly identifies them as thalamocortical afferents, 
and thus as axons that carry input from the lower lev- 
els of the auditory system. The developmental course 
of this system of axons indicates that cortical pro- 
cessing of auditory information conveyed from the 
ear and brain stem begins between 4.5 months and 1 
year of age. The increasing axonal density up to at 
least 5 years of age implies that maturation of this 
system continues through the early childhood years. 


The idea that these deep axons relay auditory in- 
formation to the cortex is supported by the behav- 
ioral transition in speech sound discrimination that 
occurs in the second half of the first year of life. Al- 
though infants between birth and 4 months of age 
discriminate speech sounds equally weli in their en- 
vironmental language and one to which they are not 
exposed,?! at 6 months of age infants begin to differ- 
entially discriminate prototype and variant versions 
of native and non-native speech sounds.?? Between 
6 and 9 months of age, infants begin to listen signifi- 
cantly longer to monosyllables that have a high prob- 
ability of occurrence in their ambient language”? and 
to words that have the stress pattern of that lan- 
guage.?425 At the same time, there is reduced discrim- 
ination of sounds of a foreign language. American 
infants at 6 to 8 months respond to differences in 
Hindi speech sounds, but show little response to the 
same contrasts at 10 to 12 months.” Similarly, Japa- 
nese infants distinguish the consonants /r/ and /I/ at 
6 to 8 months of age, but not at 10 to 12 months.27 In 
another study, evoked responses showed a similar 
transition, in that between 6 and 12 months of age 
the amplitude of the mismatch negativity increased 
to contrasts in native (Finnish) phonemes, but de- 
creased to contrasts between non-native (Estonian) 
phonemes.”® It seems likely that the shift to a stron- 
ger focus on sounds of the environmental language 
that occurs in infants between 6 and 12 months of 
age is a reflection of maturation in thalamocortical 





afferents and an incipient participation of the deeper 
cortical layers in the process of auditory perception. 
Overall, the time of development of this deep ax- 
onal system, from 6 months to 5 years, coincides with 
the onset and development of perceptual language. 


LATE CHILDHOOD ] 


At 5 years of age, neurofilament expression is still 
confined to the deeper cortical layers. After 5 years 
of age, mature axons begin to appear in cortical lay- 
ers 2 and 3, and by 11 to 12 years of age their density 
is equivalent to that of young adults (Fig 1). These 
axons represent corticocortical connections, such as 
commissural axons, which interconnect the cerebral 
hemispheres and allow the two halves of the brain to 
work together. They also include association fibers, 
which interconnect different areas of cortex within 
the same hemisphere. These intrahemispheric and in- 
terhemispheric axons form the morphological basis 
for greater complexity in cortical processing of au- 
ditory input. 


Studies of auditory perceptual skills during late 
childhood support the notion of increasing complex- 
ity in cortical information processing. Studies of per- 
ception of sound in noise? and speech in noise30,31 
indicate that this ability improves steadily across late 
childhood and the teen years. In addition, the perfor- 
mance of children in perception of speech distorted 
by binaural switching, interruption, filtering, or spec- 
tral degradation improves markedly between 4 to 5 
years and 11 to 12 years.>2-34 Thus, the time course 
of maturation of perception of masked and degraded 
auditory stimuli, which makes greater demands on 
intracortical processing, matures in parallel with the 
superficial layers and the corticocortical connections 
they generate. These intrinsic cortical connections 
should mediate the improvement in perceptual skills 
that occurs between 5 and 12 years of age, and their 
development represents the final stage in the matura- 
tion of the human auditory cortex. 
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SOME CRITICAL DEVELOPMENTS IN ACQUIRING NATIVE 
LANGUAGE SOUND ORGANIZATION DURING THE FIRST YEAR 


PETER W. JUSCZYK, PHDt 
BALTIMORE, MARYLAND 


Although infants are born with rather sophisticated capacities for discriminating and categorizing speech sounds, they still must 
learn about the sound organization of their native language. Before 6 months, infants show relatively little sensitivity to native 
language versus non-native language sound organization. Shortly thereafter, infants recognize which sounds and sound sequences 
and rhythmic patterns are likely to appear in native language words. Knowledge of these features plays a critical role in how and 
when infants segment words from fluent speech. Word segmentation abilities develop rapidly between 7.5 and 10.5 months. Initially, 
English-learners segment words using stress cues. Soon after, they learn to use other potential cues to word boundaries. As their 
lexicons develop during the second year, they begin to use information about known words in segmenting and learning new words. 


KEY WORDS — native language, sound organization. 


Studies of infants’ speech perception capacities be- 
gan about 30 years ago. At that time, very little was 
known about the abilities of infants to distinguish 
one speech sound from another. There was much 
speculation about the possibility that infants might 
not discriminate speech sounds until after they had 
begun to produce’ speech. However, Eimas et al! 
found that even 1-month-old infants perceive subtle 
distinctions among speech sounds, such as the dif- 
ference between /ba/ and /pa/. Subsequent studies 
demonstrated that infants can perceive a wide range 
of phonetic contrasts,? even ones that do not occur 
in the native language that infants are learning.? An 
implication drawn from these findings is that even 
from birth, infants have the capacity to discriminate 
phonetic contrasts from any of the world’s languages. 


However, infants’ speech perception capacities are 
not limited to responding to differences among speech 
sounds. They also display some ability to adjust to 
the variability that occurs in the production of dif- 
ferent speech sounds. For example, because of dif- 


ferences in their vocal tracts and articulators, differ- . 


ent speakers’ productions of the same speech sounds 
differ from each other acoustically. Nevertheless, in- 
fants at 6 months display some ability to recognize 
the same syllable when produced by a range of dif- 
ferent speakers.* Similarly, changes in speaking rates 
affect the acoustic characteristics of speech sounds, 
yet 2- to 3-month-old infants show an ability to com- 
pensate for such differences.5 


The speech perception capacities documented for 
infants in the first half-year are general, in that they 


can be applied to the sound organization of any lan- 
guage. There is little evidence that these capacities 
have been influenced by experience with a particu- 
lar native language. However, although these early 
perceptual capacities provide infants with a foothold 
for acquiring language, infants still have much to 
learn about the sound organization of their native lan- 
guage. Languages differ in which elementary sounds 
they use in constructing words. Thus, the “th” sounds 
used in English words (the 2 sounds in “thin” and 
“that”) do not occur in French. Moreover, the par- 
ticular sequences of sounds that are permissible in 
words differ across languages. So, while Polish al- 
lows words to begin with consonant sequences like 
/db/ and /kt/, English does not. The rhythmic proper- 
ties of words in different languages also vary. En- 
glish more or less uses an alternating pattern in which 
stressed syllables alternate with unstressed ones (as 
in “alligator’’). In contrast, the syllables of Spanish 
words receive more nearly the same amount of stress. 
Infants must discover for themselves the sound or- 
ganization of their native language. 


DISCOVERING SOUND ORGANIZATION OF 
NATIVE LANGUAGE 

Infants learn much about the sound organization 
of their language during the second half of their first 
year. Between 6 and 9 months, infants discover which 
sounds and sequences of sounds are likely to be used 
in native language words. This ability was first dem- 
onstrated in a study® in which infants heard lists of 
unfamiliar words spoken in either Dutch or English. 
The rhythmic patterns of words in these two lan- 
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guages are very similar, but the languages differ in 
some of the sounds that they use (eg, English and 
Dutch /r/ sounds are different) and which sequences 
of sounds they permit (eg, sequences such as /zw/, 
/vl/, and /kn/ are pronounced in Dutch words, but 
not in English). When 6-month-old English-learners 
were tested on lists consisting of either Dutch or 
English words, they showed no consistent preference 
for English over Dutch words (ie, they listened about 
equally long to each). However, 9-month-old infants 
showed a very different pattern: 9-month-old Dutch 
infants listened significantly longer to Dutch lists, 
whereas English-learning 9-month-old infants lis- 
tened longer to English lists. Hence, by 9 months, 
infants appear to have learned which sounds and se- 
quences of sounds can appear in words in their na- 
tive language. 


During the same period, infants also are learning 
about the characteristic rhythmic patterns of native 
language words. For instance, the predominant stress 
pattern of 2-syllable words in English is a stressed 
syllable followed by an unstressed syllable, as in the 
words “doctor” and “candle.” Approximately 90% 
of the content words in English conversations begin 
with a stressed syllable.’ Although English has many 
2-syllable words beginning with an unstressed syl- 
lable followed by a stressed one (eg, “device,” “sur- 
prise”), these are much less frequent in conversational 
speech. In a study to determine when English-learn- 
ers develop sensitivity to this property of words,’ in- 
fants were tested on lists composed entirely of either 
stressed-unstressed words (such as “falter,” “donor,” 
etc) or unstressed-stressed words (such as “assign,” 
“comport,” etc). English-learning 6-month-old in- 
fants displayed no listening preference for either type 
of list, but 9-month-old infants showed a significant 
preference for the words with the stressed-unstressed 
pattern (ie, those with the pattern predominant in En- 
glish). Therefore, by 9 months, infants have learned 
about the typical rhythmic organization of words in 
their language. 


SEGMENTING WORDS FROM FLUENT SPEECH 


The fact that infants develop sensitivity to the char- 
acteristic sounds, sound sequences, and rhythmic pat- 
terns of native language words between 6 and 9 
months has interesting implications for other aspects 
of language acquisition. To acquire any language, one 
must necessarily learn its words. This would not be 
a serious problem if words were presented to infants 
one at a time. However, only about 9% of the speech 
that infants hear consists of isolated words.? Mostly, 
infants hear words in multiword utterances. The prob- 
lem is that in fluent speech, words are typically pro- 
duced without any pauses between them, so that it is 


hard to determine where one word ends and another 
begins. Although this fact is not always apparent in 
listening to one’s native language, it becomes evi- 
dent in listening to speech in an unfamiliar foreign 
language. Segmenting words is easier in our native 
language than in a foreign language, not because En- 
glish is produced more slowly or includes more pauses 
between words, but because we have learned to use 
certain cues to the boundaries of words in fluent 
speech. Languages differ considerably as to which 
cues are most reliable for identifying word bound- 
aries. Word segmentation cues are tied to the partic- 
ular sound organization of a language, so the cues 
learned for English will not necessarily help in a for- 
eign language. 

The information that English-learners have ac- 
quired about the sounds, sound sequences, and rhyth- 
mic properties of words by 9 months is useful in word 
segmentation. It is not surprising, then, that infants 
begin to segment words during this period. To test 
infants’ abilities to segment words, Jusczyk and As- 
lin! familiarized 7.5-month-old infants with repe- 
titions of a pair of isolated words, such as “cup” and 
“dog” (or “feet” and “bike”). After the infants had 
listened to each familiarization word for 30 seconds, 
they were tested on 4 different 6-sentence passages. 
Two passages included 1 of the familiarization words 
in each sentence; the other 2 included comparable oc- 
currences of 2 other words not heard during familiari- 
zation. That the infants listened significantly longer 
to the passages containing the familiarization words 
suggests that they recognized these words in fluent 
speech contexts. By comparison, 6-month-old infants 
tested with the same materials gave no evidence of 
recognizing the familiarization words in the passages. 
To determine whether the 7.5-month-old infants re- 
ally recognized the words or just recognized a sali- 
ent feature of the words, such as their vowels, we 
tested another group of infants. These infants were 
familiarized with items such as “tup” and “bawg” 
and were tested on the 4 original passages. There 
was no evidence that infants generalized from the 
familiarization items to the similar-sounding words 
in the passages. This finding suggests that the infants 
in the earlier experiment were actually segmenting 
the whole words, and not just responding to a salient 
property of them. 


Subsequent research has focused on what infants 
rely on in segmenting words. Indications are that En- 
glish-learners initially rely on the predominant stress 
patterns of words to locate the boundaries of words 
in fluent speech. If so, then infants should have an 
easier time segmenting words that begin with a 
stressed syllable than segmenting ones that begin 
with an unstressed syllable. To investigate this idea, 
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Fig 1. Average orientation times of 20 normal-hearing 
17-month-old children and one 18-month-old cochlear 
implant recipient to named (target) and unnamed (non- 
target) objects on test trials, according to whether taught 
word came from dense or sparse lexical neighborhood. 


Jusczyk et al!! familiarized 7.5-month-old infants 
with words with stressed initial syllables, such as 
“hamlet” and “kingdom,” and ones with unstressed 
initial syllables, such as “beret” and “device.” When 
tested on 6-sentence passages that either did or did 
not include the familiarized words, the infants were 
able to segment the words with stressed initial syl- 
lables, but not those with unstressed initial syllables. 
The authors concluded that English-learners begin 
segmenting words using stressed syllables as indi- 
cators of the onsets of new words in fluent speech. 
In fact, the tendency of English-learners to rely on 
stressed syllables to mark the onsets of words is so 
strong that they apply it even when listening to speech 


‘in a foreign language. Houston et al!? found that En- 


glish-learning 9-month-old infants were able to seg- 
ment Dutch words with stressed initial syllables from 
fluent Dutch speech. 


However, if infants relied only on the location of 
stressed syllables to mark the onsets of words, they 
would never segment words beginning with un- 
stressed syllables. What this initial strategy seems to 
do is not only to help infants segment words begin- 
ning with stressed syllables, but also to provide them 
with a means of learning about other types of segmen- 
tation cues. Breaking the input into smaller chunks 
beginning with stressed syllables allows infants to 
observe how segments and sequences of segments 
typically align with the beginnings and endings of 
words. In a language like English, sound sequences 
such as /kt/ and /db/ will never occur together at the 
beginning of these chunks. Thus, when such se- 
quences occur, each segment probably belongs to a 
different word. Similarly, languages have varied ways 
of pronouncing the same segment, so the /t/ at the 
beginning of words like “top” differs from the one 
that is produced at the end of a word like “pot.” By 
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Fig 2. Average orientation times of 20 normal-hearing 
17-month-old children and one 18-month-old cochlear 
implant recipient to appropriately named objects on test 
trials, according to whether taught word came from dense 
or sparse lexical neighborhood. 


attending to the positioning of such segments in the 
chunks provided by stress cues, infants can uncover 
another type of word segmentation cue. In fact, by 9 
months, English-learners can use cues relating to se- 
quences of segments, ]3 and:by 10.5 months, they can 
use information about the positioning of individual 
segments in recognizing words in fluent speech.!4 
Interestingly, it is at the latter age that English-learn- 
ers can segment words beginning with unstressed syl- 
Jables.}5 


HOW KNOWLEDGE OF WORDS AFFECTS 
‘ LEARNING OF NEW WORDS 
- Infants begin segmenting words by using informa- 
tion about the sound organization of the native lan- 
guage. Adults, too, use such information, but they 
also seem to use their lexical knowledge (ie, words 
stored in their lexicon) in segmenting words.}617 
When might infants begin to use such information 
in segmenting and learning words? We are explor- 


‘ing this issue with 17-month-old children.!8 Our ob- , 


jective has been to determine whether it is easier for 
infants to learn a new word that sounds like many 


- other words they know (ie, that comes from a dense 


lexical neighborhood), or one that sounds like only 
afew known words (ie, that comes from a sparse lex- © 
ical neighborhood). Lexical neighbors are defined as 
ones differing by a single phonetic segment from a 
particular word (eg, lexical neighbors of “cat” are 


“at,” “scat,” “pat,” “cut,” “cap,” etc). To ensure that 
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the 20 normal-hearing infants that we tested would 
have equal familiarity with the lexical neighbors and 
the words being taught in the experiment, we cre- 
ated our sets of lexical neighborhoods around the 
words to be taught, “terb” and “pawch.” Each infant 
was first exposed to a dense neighborhood (12 dif- 
ferent neighbors) of one of the target words and a 
sparse neighborhood (3 neighbors and 9 unrelated 
items) for the other. To ensure learning, infants heard 
each list of neighbors 6 times before training on the 
target words. Note that the target words did not ap- 
pear in these lists. 


After the familiarization period, the infants were 
taught the words by means of the split-screen prefer- 
ential looking paradigm.!* Four training trials (2 for 
each target word) occurred in which a picture ap- 
peared on | side of the screen, and the infants heard 
3 repetitions of the target word (ie, “terb” or “pawch”). 
A block of 4 test trials (2 for each target word) fol- 
lowed in which pictures of the 2 objects associated 
with each target word appeared side by side. Test 
trials began with a 1-second period in which the pic- 
tures were shown in silence, followed by a 4-second 
period in which 3 repetitions of | of the 2 target words 
occurred. The infants’ learning of the target words 
was indexed in 2 ways: 1) by their average overall 
time spent looking at the object corresponding to the 
target word during the 4-second test trials and 2) by 
their reaction time required to look at the correct ob- 
ject named by the target word. Two additional blocks 
of training and test trials followed the first set. 


In addition to the normal-hearing infants, we tested 
an 18-month-old infant who had received a cochlear 
implant from Dr John Niparko at 10 months of age. 
For purposes of comparison, this child’s data are pre- 


sented alongside those of the normal-hearing infants. 
Figure | shows the average times spent in orientation 
to the correct and incorrect objects for the target word 
from the dense neighborhood and the one from the 
sparse neighborhood. As is obvious from the Figure, 
the normal-hearing infants and the patient with the 
cochlear implant were only successful in learning the 
word from the sparse lexical neighborhood. Infants’ 
greater ability to learn a new word from a sparse, as 
opposed to a dense, neighborhood is also reflected 
in the reaction time data of the same infants, shown 
in Fig 2. In particular, the normal-hearing infants and 
the patient with the cochlear implant alike shifted 
their gaze significantly faster to the correct object 
named by the target from the sparse neighborhood 
than to the one from the dense neighborhood. Thus, 
infants seem to find it easier to learn a new word that 
sounds [ke relatively few words than to learn one 
that sounds like many words that they already know. 
Finally, although the data represent a single cochlear 
implant patient’s response on a single language task, 
it is striking that his behavior appears to mirror that 
of normal-hearing infants at about the same age. 


SUMMARY AND CONCLUSIONS 


In the “irst 6 months, infants display many impor- 
tant speech perception capacities that provide a foun- 
dation for learning language. However, only in the 
latter hal” of their first year do they give evidence of 
learning the sound properties of their native language. 
A critica achievement during this time is develop- 
ing the ability to segment words from fluent speech, 
which al ows them to begin to build a lexicon. As 
knowledge of particular words grows during the sec- 
ond year, this has further consequences for the learn- 
ing of new words. 
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MULTIPLE AUDITORY STEADY-STATE RESPONSES 


TERENCE W. PICTON, MD, PHD 
ANDREW DIMITRIJEVIC, MSC M. SASHA JOHN, PHD 
TORONTO, CANADA 


Steady-state responses are evoked potentials that maintain a stable frequency content over time. In the frequency domain, 
responses to rapidly presented stimuli show a spectrum with peaks at the rate of stimulation and its harmonics. Auditory steady-state 
responses can be reliably evoked by tones that have been amplitude-modulated at rates between 75 and 110 Hz. These responses 
show great promise for objective audiometry, because they can be readily recorded in infants and are unaffected by sleep. Responses 
to multiple tones presented simultaneously can be independently assessed if each tone is modulated at a different modulation fre- 
quency. This ability makes it possible to estimate thresholds at several audiometric frequencies in both ears at the same time. Because 
amplitude-modulated tones are not significantly distorted by free-field speakers or microphones, they can also be used to evaluate 
the performance of hearing aids. Responses to amplitude and frequency modulation may also become helpful in assessing suprathreshold 
auditory processes, such as those necessary for speech perception. 


KEY WORDS — amplitude modulation, auditory steady-state response, frequency modulation, objective audiometry. 


TIME AND FREQUENCY DOMAINS 


In order to understand the auditory steady-state re- 
sponses, it is helpful to move from the time domain 
to the frequency domain. This is achieved by means 
of the Fourier transformation. Any signal that changes 
over time can also be viewed as a “spectrum” of val- 
ues at different frequencies. The Fourier transforma- 
tion provides values for both amplitude and phase at 
each of the frequencies in the spectrum. A spectrum 
showing the amplitude or power (the square of the 
amplitude) of the signal at each frequency is easier 
to evaluate than the phase spectrum. Obtaining mul- 
tiple amplitude spectra over successive brief periods 
of time can give a “spectrogram,” which shows how 
the amplitude spectrum changes with time. These 
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transformations are familiar to audiologists in the 
context of acoustic signals (Fig 1). A continuous pure 
tone has a single-line spectrum that remains stable 
over time in the spectrogram. Speech sounds com- 
bine bands of frequencies (formants) that maintain 
relatively stable amplitudes during vowels and that 
change rapidly in amplitude, frequency, or both dur- 
ing consonants. In order to understand speech, a sub- 
ject must therefore be able to hear the frequencies 
that make up the speech signal and to discriminate 
rapid changes in frequency and amplitude. Acoustic 
noise covers a bandwidth of frequencies in the spec- 
trum and remains stable over time. 


Similar transformations can be applied to the elec- 
trical signals recorded from the human brain (Fig 2). 


Spectrogram 


Fig 1. Acoustic signals in time and frequency 
domains. Left column shows time waveforms 
for pure tone, speech sound, and white noise. 
Amplitude or intensity (1) of signal is plotted 
against time (T). In middle column, same sig- 
nals are presented as spectra, and intensity is 
plotted against frequency (F). In right column, 
spectra are followed as they change with time. 
Horizontal axis of middle column has become 
vertical axis of right column, and amplitude (or 
intensity) of signal at each frequency is now 
represented on simple 2-value scale. Time 
waveform and spectrum for speech sound are 
for vowel /a/. Spectrogram of speech sound 
includes transient stop-consonant /d/ occurring 
before vowel. General idea of this Figure de- 
rives from some hand-drawn diagrams by 
Charles Berlin. 
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Fig 2. Electroencephalogram (EEG) signals in 
time and frequency domains. This Figure is 
constructed similarly to Fig 1, but it uses EEG 
signals, rather than acoustic signals. Upper row 
of data shows 40-Hz steady-state response. 
Middle row shows transient response. First 2 
columns of middle row refer to auditory brain 
stem response (0 to 10 ms), whereas third col- 
umn follows sequence of transient evoked po- 
tentials up to 500 ms. As latency increases, re- 
sponses become slower in their frequency. 
Lower row shows EEG with identifiable spec- 
tral peaks at frequency of alpha rhythm (aster- 
isk) and other rhythms. 


Transient 


Noise 


Responses that are driven by a rapidly repeating stim- 
ulus have their main frequency components at the 
rate of stimulation and at its harmonics. These evoked 
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Fig 3. Auditory steady-state responses. Upper part of Fig- 
ure shows 40-Hz steady-state response, and lower half 
shows 100-Hz response. At top, left column shows po- 
tentials evoked by brief 500-Hz tone presented at rate of 
10/s and then at rate of 40/s. At bottom, responses are 
shown when tone is presented at rates of 10/s and 100/s. 
At rapid rates of stimulation, responses to consecutive 
stimuli are superimposed to give periodic response at fre- 
quency of stimulation. At right of Figure are shown re- 
sponses viewed in frequency domain. Data in time wave- 
forms have been submitted to Fourier transformation to 
obtain spectrum. Steady-state response shows up as a 
prominent peak at frequency of stimulation. Resolution 
of spectrum is reciprocal of time of sweep, and thus, there 
are relatively few points at which frequencies are mea- 
sured. For example, in lower spectra, resolution is 50 Hz 


(1/0.02), and there are frequency measurements at 0, 50, 
100, 150, and 200 Hz, etc. 
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potentials are described as “steady-state responses,” 
because their frequency components remain stable 
over time.! Responses to a brief sound or to an abrupt 
change in an auditory stimulus evoke a transient 
evoked potential that changes its frequency spectrum 
over time. The early components of the transient 
evoked potentials, which are composed of rapid fre- 
quencies, lead into the middle-latency and late evoked 
potentials, which are composed of slower frequen- 
cies. The background noise of the electroencephalo- 
gram (EEG) contains many frequencies. These gen- 
erally decrease in amplitude with increasing frequen- 
cy. Also, accentuated regions of the EEG spectrum 
indicate rhythms such as the alpha rhythm at frequen- 
cies near 10 Hz. 


AUDITORY STEADY-STATE REPONSES 


The most widely known auditory steady-state re- 
sponse is the 40-Hz evoked potential described by 
Galambos et al.? The middle-latency components of 
the transient auditory response recur at interpeak la- 
tencies of approximately 25 ms. When the stimulus 
rate is once every 25 ms (40 Hz), the response to one 
stimulus lines up with the responses to earlier stimuli, 
and the waves are superimposed to create a large pe- 
riodic response. The response is recognizable in the 
time domain as a repeating sinusoid and in the fre- 
quency domain as an isolated peak at 40 Hz (Fig 3, 
top). This response can be used to measure audiomet- 
ric thresholds in awake adults and older children; 
however, the response is difficult to measure in in- 
fants and young children. Furthermore, the 40-Hz 
response is very susceptible to changes in the level of 
arousal, showing small amplitudes during sleep*® and 
even smaller amplitudes during anesthesia.’ 


Although they are most prominent at frequencies 
of stimulation near 40 Hz, auditory steady-state re- 
sponses also occur at more rapid frequencies.*-? The 


18 Picton et al, Multiple Auditory Steady-State Responses 


Carrier of 1 kHz 
100% AM 
25% FM 
Modulation at 81 Hz 


Activation at 1 kHz 
Region of Basilar 
Membrane 





Cochlea 


responses at stimulus rates of between 75 and 110 
Hz probably represent the superimposed auditory 
brain stem responses (Fig 3, bottom). These responses 
have become important for clinical audiometry, be- 
cause they can track hearing thresholds reasonably 
well,!0-14 are unaffected by sleep,>-®-!5 and can be re- 
liably recorded in infants.'!:!6 


Sound 


Each auditory steady-state response is character- 
ized by an amplitude and phase. The spectrum usu- 
ally shows only the amplitude information. Phase can 
be represented by a polar plot, which shows the am- 
plitude as the length of a vector plotted with its ori- 
gin at zero and the phase as the angle between the 
vector and the x-axis. The responses can be distin- 
guished from noise by comparing the responses at 
the frequency of stimulation to the activity recorded 
at adjacent frequencies in the spectrum.!!:!7:!8 The 
recording computer can thus automatically recognize 
a significant response. Not only does the patient not 
have to respond subjectively, but the clinician does 
not have to make a subjective interpretation of the 
response as being present or absent. The evaluation 
of the response is as objective as its generation.!9 


Tones that are sinusoidally amplitude modulated 
have much narrower acoustic spectra than do the brief 
tone bursts used to evoke the transient auditory brain 
stem response”? and might therefore provide more 
frequency-specific measurements. Sinusoidally am- 


Fig 5. Multiple auditory steady-state responses 
(MASTER). If multiple auditory stimuli are 
presented together (left), they will each acti- 
vate region of basilar membrane specific to 
carrier frequency (middle). This activation will 
then evoke steady-state response at frequency 
of modulation. Frequency of modulation in- 
creases with increasing carrier frequency. 


Steady-State 
Response at 81 Hz 


Fig 4. Recording auditory steady-state response. 
Stimulus is constructed by synchronously modu- 
lating amplitude and frequency of continuous tone. 
This stimulus has an acoustic spectrum that is cen- 
tered near carrier frequency (bottom left). At low 
and moderate intensities, stimulus maximally ac- 
tivates basilar membrane in cochlea at region spe- 
cific to carrier frequency (center). Basilar mem- 
brane is represented with stapes at left and apex at 
right. Response is generated in brain stem and rec- 
ognized in general background EEG noise at 
modulation frequency (right). 


Brain 


plitude modulated tones contain energy only at the 
frequency of the carrier tone and at 2 sidebands, sepa- 
rated from the carrier by the frequency of modula- 
tion. Frequency modulation can be combined with 
amplitude modulation to give a larger response?! 
without significantly changing the narrow spectrum 
of the stimulus (Fig 4). 


MULTIPLE SIMULTANEOUS STIMULI 


Multiple tones (characterized by different carrier 
frequencies or ears of stimulation) can be presented 
simultaneously by the “MASTER” (Multiple Audi- 
tory STEady-state Responses) technique.?2-23 If each 
stimulus has its own signature modulation frequency, 
each response can be separately recognized in the re- 
corded EEG spectrum. If the responses do not inter- 
act, it is more efficient to record responses to multi- 
ple simultaneous stimuli than to record separate re- 
sponses to single stimuli.!>-2224 Four stimuli pre- 
sented simultaneously to 1 ear can evoke responses 
that are mediated through 4 different regions of the 
cochlea (Fig 5). At low or moderate intensities, the 
carrier frequencies determine the location on the basi- 
lar membrane that is maximally activated, and the 
modulation frequencies determine the location in the 
brain’s electrical spectrum in which the response can 
be recognized. The audiometric evaluation can be 
made even more efficient by testing both ears simul- 
taneously. A total of 8 stimuli (4 in each ear) can be 


Four stimuli presented 
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Cochlea Brain 


ay, 


Picton et al, Multiple Auditory Steady-State Responses 19 


MASTER Recordings 


30 lnthideiaialontiaielica wade aLbien nadine tlisind 


40 aahisikatsntihaliisah iledesslldaclinaabids nine mián hihi 





vY 
60! E E a EAEI SY 
Woy 

70 ZLU 

v Wy Yy y 50a] 
80 y 

dB HL 
70 80 90 100 110 dB HL 


Modulation Frequency (Hz) 


simultaneously presented and 8 separate responses 
detected. 


At high intensities, each stimulus will activate a 
broad region of the basilar membrane (with the acti- 
vation pattern spreading particularly toward the base). 
These regions of activation will be much broader than 
those shown in Figs 4 and 5. At these high intensities, 
the activation patterns for the different stimuli will 
overlap, and significant interactions may occur in the 
generation of the electrical responses (and in the per- 
ception of the sounds). For high-intensity sounds, one 
cannot therefore be sure of the exact location on the 
basilar membrane at which a response is initiated, al- 
though the presence of a response indicates that the 
auditory system is responding to the sound. 


EVALUATING AUDIOMETRIC THRESHOLDS 


The multiple auditory steady-state responses can 
be recognized down to intensities that are close to 


Thresholds 





© Behavioral 
= MASTER 


Fig 6. Threshold estimation using multiple steady-state 
responses. This Figure shows how well thresholds for 
auditory steady-state responses compare to behavioral 
thresholds in 86-year-old patient with long-standing 
sensorineural hearing loss. Steady-state responses are 
shown on left. Only a portion of each spectrum is plot- 
ted— from 70 to 110 Hz— so that individual responses 
can be easily recognized. Modulation frequencies in- 
creased with increasing carrier frequency (from 500 
to 4,000 Hz), and responses to left ear were interca- 
lated with those for right ears. Right ear responses 
considered as significantly different from noise are in- 
dicated with open triangles, and left ear responses are 
indicated with closed triangles. Figure illustrates prob- 
lem that can occur in using these responses to estimate 
thresholds. There is significant response to 500-Hz car- 
rier in left ear (modulation frequency of 80 Hz) at 50 
dB hearing level, but not at 60 dB. Response at 50 dB 
may be false alarm, or absence of response at 60 dB 
may be a miss (response present but not detected). At 
right, thresholds obtained behaviorally and by multiple 
auditory steady-state responses (MASTER) are plot- 
ted on audiograms. In audiogram for left ear, response 
to 500-Hz tone in left ear at 50 dB hearing level is 
accepted as real. 


hearing threshold. The thresholds for these physio- 
logical responses are usually between 10 and 20 dB 
above behavioral thresholds.!!-13.14 This is true for 
both hearing-impaired and normal-hearing subjects. 
Figure 6 shows the responses in 1 patient, and Fig 7 
compares the behavioral thresholds to the thresholds 
for the steady-state responses in 21 patients. The 
MASTER technique also works well with patients 
who use hearing aids.?5 The stimuli used to evoke 
the auditory steady-state responses can be amplified 
by free-field speakers or by hearing aids without much 
distortion. Aided thresholds can then be estimated to 
evaluate the adequacy of the aided hearing. 


FUTURE DIRECTIONS: EVALUATION OF 
SUPRATHRESHOLD HEARING 


Although the audiogram is a helpful tool for as- 
sessing hearing impairment, other types of informa- 
tion are also important. Measuring the suprathreshold 
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Fig 7. Threshold estimation with multiple steady- ee 
state responses.. Figure relates behavioral thresh- a 0 
olds obtained with conventional audiometry (y- 
axis) to physiological thresholds for auditory 
steady-state responses (x-axis) in 21 subjects (41 
ears), of be 5 had normal hearing and 16 had 4000 Hz 
sensorineural hearing loss. Some of these thresh- : . 2 
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discrimination of different sounds — through tests 
such as those that evaluate speech perception — is 
an essential part of every hearing assessment. This 
is how a patient’s deficit is determined and how ther- 
apy is monitored. l 


Auditory steady-state responses cannot directly 
evaluate speech perception. However, they can mea- 
sure the ability of the cochlea and brain stem to dis- 
tinguish small changes in frequency and intensity, 
thereby providing the cortex with the basic informa- 
tion for interpreting sounds as speech. If the same 
carrier frequency is independently modulated in am- 
plitude and in frequency with each modulation occur- 
ring at a different rate, 2 separate responses can be 
obtained.?6 If we combine 50% amplitude modula- 
tion and 25% frequency modulation at each of the 
audiometric frequencies 500, 1,000, 2,000, and 4,000 
Hz, we can then evoke 8 different responses. The 
number of responses recognized varies with how well 
the subject perceives speech when the speech sounds 
are increased in intensity from threshold. This ap- 
proach needs to be more extensively evaluated. How- 
ever, it might lead to an objective evaluation of how 
the cochlea processes the information needed for per- 
ceiving speech. 


Another important attribute of suprathreshold hear- 
ing is the-ability to make discriminations of frequency 
and inteasity when they are occurring very rapidly. 
Psychopaysical assessments of the ability of the au- 
ditory system to follow changes in amplitude or fre- 
quency at different rates provide “temporal modula- 
tion transfer functions.”2’ Another approach to assess- 
ing how well a subject can recognize brief changes 
in a sound is to measure “gap detection.”2° An in- 
ability tc follow rapid changes in a sound occurs in 
many auditory disorders. Disorders of temporal pro- 
cessing ere particularly evident in patients with au- 
ditory neuropathy.2? The auditory steady-state re- 
sponses zan be recorded in normal subjects at high 
modulation frequencies.®-9.30 Determining how well 
the respcnses follow at rapid rates might provide an 
objective assessment of the auditory system’s abil- 
ity to make rapid temporal resolutions. 


SUMMARY 


Multiple auditory steady-state responses are eas- 
ily and objectively measured in the frequency do- 
main. These responses are very useful for measur- 
ing auditory thresholds and may become helpful for 
evaluatirg the ability of the auditory system to dis- 
criminate sounds at suprathreshold levels. 
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ASSESSMENT OF HEARING IN INFANTS WITH MODERATE TO 
PROFOUND IMPAIRMENT: THE MELBOURNE EXPERIENCE WITH 
AUDITORY STEADY-STATE EVOKED POTENTIAL TESTING 


GARY RANCE, MSC ROBERT J. S. BRIGGS, MBBS, FRACS 
MELBOURNE, AUSTRALIA 


Auditory steady-state evoked potential (ASSEP) testing has been used in the assessment of hearing in infants at the University of 
Melbourne School of Audiology Clinic during the past 10 years. This study examines the evoked potential and behavioral hearing 
test findings for 200 children in whom permanent, moderate to profound hearing loss was diagnosed with the ASSEP procedure. The 
ASSEP audiograms were obtained with amplitude and frequency modulated tones at octave frequencies (500 to 4,000 Hz). In 184 
infants, the hearing loss appeared to be of sensorineural origin. For these children, there was a strong correlation between ASSEP 
findings and behavioral thresholds, with Pearson r coefficients ranging from 0.31 to 0.93 with test frequency. As such, the ASSEP 
findings obtained in the first months of life were accurate enough to form a basis for fitting amplification and early intervention. For 
the 16 children whose hearing loss appeared to be the result of auditory neuropathy, however, ASSEP thresholds showed little or no 
relationship to the behavioral audiogram. 


KEY WORDS — evoked potential, hearing, infant. 


INTRODUCTION The applicability of auditory steady-state evoked 

The link between congenital hearing loss and long- potentials (ASSEPs) for evaluation of hearing level 
term speech and language difficulties is now well in infants and young children has been investigated 
established. The provision of amplification and edu- over the past decade. ASSEPs are continuous scalp- 
cational and family support, however, has been shown recorded potentials that arise in response to regularly 
to significantly reduce the damaging effects of audi- varying stimuli such as sinusoidal amplitude modu- 
tory deprivation, particularly if the intervention takes lated and frequency modulated (AM/FM) tones.° The 
place within the first 6 months of life.! Identifying response is periodic at the modulation frequency and 
affected children within this critical period is an im- can be extracted mathematically from the electroen- 
portant objective for hearing professionals. To this cephalogram by means of Fourier analysis.’ ASSEPs 
end, neonatal and infant screening programs using are suitable for the testing of infants in that they are 
test procedures such as otoacoustic emissions and relatively unaffected by subject state (natural sleep 
auditory brain stem responses (ABRs) have been es- or sedation) when modulation rates in excess of 70 
tablished in clinics around the world. Hz are used,*”* and they are reliably present in chil- 
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fail these screening assessments remains a signifi- The ASSEP testing using continuous modulated 
cant challenge. Appropriate recommendations re- tones offers a number of advantages over assessment 
garding the selection and fitting of devices such as techniques that require short-duration stimuli (such 
hearing aids and cochlear implants can only be made as tone bursts) to produce synchronized neural ac- 
with an understanding of both the degree and config- tivity. One important feature of the continuous modu- 
uration of a child’s hearing loss. Conditioned audio- lated tone is its frequency specificity. Amplitude mod- 
metric test techniques (such as visual reinforcement ulated tomes, for example, contain energy that is re- 
audiometry), which can provide increasingly accu- stricted to the carrier frequency and 2 sidebands sepa- 
rate information in older children (>6 months), are, rated from the carrier by the modulation frequency. 
unfortunately, not suitable for quantifying hearing loss The acoustic spectra of combined AM/FM tones are 
in very young infants or those with visual or devel- more complex. However, provided that the width of 
opmental disabilities. As such, there has been con- frequency modulation is restricted to 10% of the car- 
siderable interest in the development of hearing as- rier or less, the signal energy is contained within a 
sessment techniques that can “objectively” estimate critical band and is narrower than repeated tone bursts 
the audiogram, and thus serve to confirm and extend at comparable presentation rates.” This specificity, 
findings from behavioral assessment. and the fact that ASSEPs can be elicited by carrier 
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Fig 1. Risk factors for children referred through Victo- 
rian Infant Hearing Screening Program. Gray bars — 
subjects with sensorineural hearing loss; black bars — 
subjects with auditory neuropathy. 


tones across the audiometric range, allows the gener- 
ation of evoked potential audiograms, which, in hear- 
ing-impaired subjects, can reflect the pattern of the 
hearing loss.458 


Another benefit afforded by the tonal stimuli used 
in ASSEP testing relates to presentation range. Be- 
cause the tones are continuous, the calibration cor- 
rections (accounting for differences in temporal sum- 
mation for brief versus long-duration stimuli) that 
limit the maximum levels of tone bursts to around 
100 to 105 dB nHL,? are not required. Modulated 
tones can therefore be presented at levels as high as 
120 dB HL, and as was demonstrated in our previ- 
ous studies,*:!9 quantification of hearing loss in the 
profound range is possible. 


ASSEP testing has been used clinically at the Uni- 
versity of Melbourne School of Audiology Clinic 
since 1990 for hearing loss diagnosis in infants and 
for objective confirmation of hearing levels in young 
cochlear implant candidates. This retrospective study 
was aimed at assessing the reliability of ASSEP test- 
ing in very young children in whom immediate con- 
ditioned behavioral testing was not possible, by com- 
paring the evoked potential thresholds measured ini- 
tially with subsequent behavioral hearing levels. 


MATERIALS AND METHODS 


Findings from 200 infants and young children 1 
to 8 months of age at the time of the ASSEP evalua- 
tion (mean age, 3 months) are presented in this study. 
The subjects were all referred for ASSEP testing af- 
ter failure on a click-evoked ABR screening assess- 
ment (ABR threshold of >40 dB nHL in both ears). 


Subjects were included in the study if they showed 
“elevated” ASSEP thresholds (260 dB HL) at any 
test frequency, and no evidence of middle ear dis- 


ease (normal multiple probe tone tympanometric re- 
sults and normal otoscopy). The 60-dB HL cutoff 
level was selected because Rickards et al? found that 
more than 97% of normal neonates have ASSEPs 
for stimuli presented at 55 dB HL. 


The hearing loss in 184 of the children appeared 
to be of sensorineural origin, as established at the 
time of the diagnosis by the absence of preneural re- 
sponses (otoacoustic emissions or cochlear micro- 
phonics) or the presence of a repeatable ABR at ele- 
vated levels. The remaining 16 children appeared to 
be suffering from a neural transmission disorder simi- 
lar to that described by Starr et al!! as “auditory neu- 
ropathy.” Each of these children had absent ABRs at 
maximum presentation levels, but had present outer 
hair cell responses such as otoacoustic emissions and/ 
or cochlear microphonics. 


As mentioned above, the subjects had all been eval- 
uated as part of an ABR screening program. This pro- 
gram was restricted to children whose histories placed 
them “at risk” for hearing impairment, and as such, 
all of the subjects in this study had at least 1 risk fac- 
tor. The risk criteria for which the children were ini- 
tially referred are summarized in Fig |. The most 
common risk factors for the subjects with sensorineu- 
ral hearing loss were prematurity with hypoxia and 
a family history of hearing loss. Among the children 
with findings consistent with auditory neuropathy, 
neonatal hyperbilirubinemia was the most common 
risk factor, being present in 8 of 16 cases. This inci- 
dence is consistent with a number of reports that have 
cited neonatal jaundice as a factor in pediatric audi- 
tory neuropathy. !%-13 


ASSEP testing was performed in a sound-treated 
room with the child in natural sleep. The electroen- 
cephalographic activity was monitored with silver— 
silver chloride disk electrodes or disposable Niko- 
med neonatal electrodes. Differential recordings were 
made between the vertex or high forehead (nonin- 
verting input) and the ipsilateral mastoid or earlobe 
(inverting input). A third electrode on the contralat- 
eral earlobe or mastoid served as a ground. The inter- 
electrode impedance was in all cases less than 10 
kQ at 260 Hz. ASSEP testing was carried out with a 
custom-built evoked potential system that used an 
IBM-compatible XT-type microcomputer to gener- 
ate stimuli and analyze responses in the manner de- 
scribed by Cohen et al.” The raw electroencephalo- 
gram was passed through a preamplifier and band- 
pass-filtered (0.2 to 10 kHz). The signal was then 
Fourier analyzed at the stimulus modulation frequen- 
cy (90 Hz) with analog multiplication followed by 
low-pass filtering to extract response amplitude and 
response phase information. The presence or absence 
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Fig 2. Distribution of auditory steady-state evoked potential (ASSEP)—behavioral threshold comparisons for ears with sensori- 
neural hearing loss (SN; unfilled squares) and ears with auditory neuropathy (AN; filled squares). Multiple data points at 
particular coordinate are represented by single icon. Dashed lines — 1:1 ASSEP-behavioral correspondence; solid lines — 
regression lines for sensorineural data. Regression formula for each frequency is shown. Total number of ears varied with test 
frequency. A) 500 Hz (SN, n= 160; AN, n = 18). B) 1 kHz (SN, n = 232; AN, n = 32). C) 2 kHz (SN, n= 125; AN, n = 7). D) 


4 kHz (SN, n= 131; AN, n = 32). 


of the response was then determined automatically 
with a statistical detection criterion based on non- 
random phase behavior (phase coherence). The par- 
ticulars of this analysis are described in detail by Co- 
hen et al. 


The test stimuli were 500-Hz, 1-kHz, 2-kHz, and 
4-kHz tones that were amplitude and frequency mod- 
ulated at a rate of 90 Hz. This rate was used to avoid 
the problems associated with ASSEP testing in sleep- 
ing subjects at lower (<70 Hz) modulation frequen- 
cies. An amplitude modulation depth of 100% and a 
frequency modulation width of 10% were combined 
to maximize the response amplitude. 


A pure tone was used to calibrate the stimulus lev- 
els. The introduction of frequency modulation did 
not alter the peak-to-peak amplitude, and the introduc- 
tion of amplitude modulation resulted in a reduction 
of only 1.76 dB.’ Behavioral measurements (show- 
ing threshold changes of <2 dB with the introduc- 
tion of amplitude modulation) in 10 adult subjects 
with normal hearing confirmed this observation. 


ASSEP thresholds were obtained by increasing the 


presentation level in 10-dB steps from a starting point 
around 60 dB HL until a statistically significant, 
phase-locked response could be obtained. The stimu- 
lus level was then decreased in 5-dB steps until the 
distribution of phase angles became random. The 
threshold was defined as the minimum level at which 
the phase coherence was statistically significant at 
the p < .03 level. Thresholds were determined in this 
way for each of the subjects by use of stimuli pre- 
sented monaurally at a range of carrier frequencies. 
In most cases, thresholds could be obtained at the 4 
test frequencies in both ears within a 45- to 60-minute 
test period (20 to 90 seconds per presentation level). 
Clinical constraints did, however, mean that not all 
of the test frequencies could be assessed for every 
child. 

Behavioral audiograms were obtained for each ear 
(monaural stimulation) under headphones or tube- 
phones for all subjects by means of conditioned au- 
diometric techniques (visual response audiometry). 
Threshold levels were determined by a 10-dB-down, 
5-dB-up search procedure. The test stimuli were war- 
ble tones at octave frequencies from 500 Hz to 4 kHz. 


m-f 
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TABLE 1. PEARSON PRODUCT MOMENT 
CORRELATION COEFFICIENT VALUES COMPARING 
ASSEP AND BEHAVIORAL THRESHOLDS 


TABLE 2. ASSEP-BEHAVIORAL THRESHOLD 
DIFFERENCE LEVELS FOR SUBJECTS WITH 
SENSORINEURAL HEARING LOSS 














Frequency (Hz) 
Group 500 1,000 2,000 4,000 
SN 0.86 0.81 0.93 0.89 
AN 0.52 0.47 0.67 0.48 


Ears with either auditory steady-state evoked potentials (ASSEP) or 
behavioral responses absent at maximum levels were not included 
in analysis. Numbers of data points varied with frequency and were 
as follows for sensorineural (SN) group: 500 Hz, n= 160; 1 kHz, n= 
232; 2 kHz, n = 125; 4 kHz, n = 131. For auditory neuropathy (AN) 
group, they were 500 Hz, n = 18; 1 kHz, n= 32; 2 kHz, n = 7; 4kHz, 
n= 32. 


“60 to 75 


ASSEP Level Frequency (Hz) 
(dB HL) 500 1,000 2,000 4,000 








9.6413.9 9811.1 58+15.7 4.04168 
80 to 95 414121 3.6482 3.3456 5845.5 
100+ 5.5477 5.0463 3347.1 15477 


Data are mean + SD. Results for each carrier frequency are divided 
into 3 categories based on ASSEP threshold level. Ears with either 
ASSEP or behavioral responses absent at maximum levels were not 
included in analysis. Numbers of data points varied with frequency 
and were as follows: 500 Hz, n = 160;1 kHz, n = 232; 2 kHz, n = 
125; 4 kHz, n= 131. 





The maximum presentation level was 120 dB HL for 
all frequencies. The age at which the subjects were 
mature enough to reliably complete this assessment 
ranged from 7 to 14 months (mean, 9 months). As 
such, there was usually a delay of approximately 6 
months between the time of diagnosis (by ASSEP 
testing) and behavioral confirmation of hearing lev- 


els. 
RESULTS 


Comparisons of ASSEP and Behavioral Thresh- 
olds. Figure 2 shows the distribution of ASSEP-be- 
havioral threshold comparisons for all of the sub- 


- jects. We obtained 727 comparisons from the 200 


children (400 ears): 178 at 500 Hz, 264 at 1 kHz, 
132 at 2 kHz, and 153 at 4 kHz. These findings rep- 
resent the occasions on which both an ASSEP and a 
behavioral threshold were established. Instances in 
which either the behavioral threshold or the ASSEP 
threshold was absent at maximum presentation lev- 
els were not included in the correlation analyses and 
have been considered separately. Thirteen of the 400 
ears (3%) showed no response at any test frequency 
on either ASSEP or behavioral threshold testing. 
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Results for Children With Sensorineural Loss. 
Overall, the findings for children with sensorineural 
hearing loss showed a strong correlation between the 
ASSEP threshold and the behavioral hearing level. 
Pearson r correlation coefficient values ranged from 
0.81 to 0.93 across the test frequencies (Table 1). As 
can be seen in Fig 2, the ASSEP threshold was typi- 
cally obtained at levels equal to or slightly above the 
behavioral hearing level. Only 6 of the 368 ears with 
sensorineural hearing loss had ASSEP thresholds that 
underestimated the subsequent behavioral thresholds’ 
by 10 dB or more. In each of these cases, there was a 
deterioration in hearing during the period between the 
initial ASSEP assessment and the behavioral testing. 
This deterioration was confirmed by repeat ASSEP 
evaluation. Results for 1 such child with a progres- 
sive hearing loss associated with Pendred’s syndrome 
are presented in Fig 3. 

Difference values were calculated for each ASSEP- 
behavioral threshold comparison (excluding those 
with confirmed deteriorating hearing loss) by sub- 
tracting the behavioral level from the ASSEP level. 
Table 2 shows the mean and SD of these difference 
Frequency (Hz) 
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Fig 3. ASSEP and behavioral audiograms (right ear) for child with progressive sensorineural hearing loss (Pendred’s syn- 
drome). A) ASSEP thresholds obtained at 2 months of age. B) First reliable behavioral test results and repeat ASSEP findings 
obtained at 8 months of age. C) ASSEP and behavioral threshold findings obtained after minor head injury at 14 months of 


age. 
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Fig 4. Behavioral hearing thresholds in ears with absent 
ASSEPs to stimuli at maximum presentation level (120 
dB HL). 


levels for each carrier frequency. Results for each 
frequency were divided into 3 categories based on 
ASSEP threshold level. As with previous studies,*:!0 
there was a tendency toward greater difference lev- 
els for lower-frequency stimuli and for lesser degrees 
of hearing loss. A one-way analysis of variance, how- 
ever, comparing ASSEP-behavioral threshold differ- 
ence values showed no significant effects for ASSEP 
threshold category or for carrier frequency. 


Results for Sensorineural Subjects With Absent 
ASSEP at Maximum Presentation Levels. Figure 4 
shows the behavioral hearing threshold levels for ears 
that showed no ASSEP to tones presented at 120 dB 
HL. The ASSEP absence in every case was consis- 
tent with profound hearing loss. Behavioral thresh- 
olds obtained on the 474 occasions in which there 
was no ASSEP were in all cases found at levels of 
2100 dB HL, and in most cases (93.4%), they were 
at levels of 2115 dB HL. 


Results for Children With Auditory Neuropathy. 
The distribution of ASSEP-behavioral threshold com- 
parisons for children with auditory neuropathy is also 
presented in Fig 2. Unlike the findings for the group 
of children with sensorineural hearing loss, these data 
show only a very weak relationship between the two 
test techniques. Pearson r correlation coefficients 
were only marginally significant, ranging from 0.47 
to 0.67 across the carrier frequencies (see Table 1 
for details). 


The ASSEP-behavioral threshold difference val- 
ues were also significantly higher for children with 
auditory neuropathy than for the sensorineural group. 
Where ASSEP thresholds were typically obtained at 
a sensation level of approximately 5 dB for the chil- 
dren with sensorineural hearing loss, the mean differ- 
ence levels for children with auditory neuropathy were 


TABLE 3. ASSEP-BEHAVIORAL THRESHOLD 
DIFFERENCE LEVELS FOR EACH CARRIER 
FREQUENCY 








Frequency (Hz) 
Group 500 1,000 2,000 4,000 


SN 6.3+9.2 9 7.2 4.0+8.1 3.44 11.1 
AN 40.8+16.0 41.2+19.4 32.1417.3 34.3415.8 


Data are mean + SD. Ears with either ASSEP or behavioral responses 
absent at maximum levels were not included in analysis. Numbers 
of data points varied with frequency and were as follows for SN 
group: 500 Hz, n = 160; 1 kHz, n = 232; 2 kHz, n = 125; 4 kHz, n= 
131. For AN group, they were 500 Hz, n = 18; 1 kHz, n = 32; 2 kHz, 
n = 7; 4 kHz, n = 32. 











approximately 30 to 40 dB (Table 3). 


DISCUSSION 


The results of this retrospective study show that 
the ASSEP procedure can accurately identify and 
quantify hearing loss in infancy. In each of the 200 
subjects, ASSEP thresholds of 260 dB HL on the ini- 
tial assessment were predictive of significant hear- 
ing loss. 


In children with sensorineural hearing loss, there 
was a strong relationship between thresholds estab- 
lished with ASSEPs in the first few months of life 
and those subsequently determined with conditioned 
audiometric techniques. Pearson product moment 
analyses demonstrated that for subjects with moder- 
ate to profound losses, the ASSEP and behavioral 
thresholds were highly correlated for frequencies 
across the audiometric range, with correlation coef- 
ficients ranging from 0.81 to 0.93. 


Although the relationship between the ASSEP and 
behavioral thresholds in these infants was strong, the 
correlation values were not quite as high as those 
obtained with the same test procedure in our study 
published in 1995.4 This earlier investigation, which 
involved a group of adults and young children with 
various degrees of sensorineural hearing loss, found 
Pearson r values in excess of 0.95 for each of the test 
frequencies. 


There are at least 3 factors that could have con- 
tributed to the slightly weaker correlations in the cur- 
rent study. The first concerns the reliability of the 
behavioral thresholds. In the former study, the sub- 
jects were either adults or older children (mean age, 
29 months) at the time of their audiometric assess- 
ment. The behavioral audiograms for the infants in 
this study were obtained at a much younger age (7 to 
14 months). Although the current results were con- 
sidered to be reliable, and were in most cases repli- 
cated over a number of test sessions, it is probable 
that the age of the subjects added to the variability 
of the findings. Second, the correlation values will 
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have been affected by the reduced range of ASSEP 
levels used in this study. In the former study, the sub- 
jects had hearing levels that spanned the audiomet- 
ric range. The current study only included ears with 
ASSEP thresholds of 260 dB HL. Restricting the 
range of possible ASSEP thresholds will in itself have 
resulted in lower Pearson r values. 


A third factor that is likely to have had an impact 
on the correlation is the time delay between the ASSEP 
and behavioral assessments. Whereas in the former 
study, all of the testing was carried out on the same 
day, the immaturity of the infants at the time of ASSEP 
evaluation (mean age, 3 months) in this investiga- 
tion meant that conditioned audiometric results were 
typically not available until approximately 6 months 
later. As such, the results (which reflect the clinical 
reality of early diagnosis) may have been influenced 
by transient middle ear effects that were not detect- 
able on neonatal impedance or otoscopy, or by pro- 
gressive cochlear losses that altered the audiograms 
between assessments. 


Despite these sources of error, the data indicated 
that for infants with sensorineural hearing loss, ASSEP 
thresholds obtained.in infancy are accurate enough 
to form a basis for hearing aid fitting and early inter- 
vention. Confident prediction of hearing thresholds 
based on ASSEP findings was possible in these cases 
because of the reasonably low variability in the 
ASSEP-behavioral threshold relationship. The degree 
of accuracy with which predictions of hearing level 
can be made is determined by the spread of behavioral 
levels that could correspond to a particular ASSEP 
result. The results (expressed in this study as the SD 
of the difference levels) indicated that the range of 
possible behavioral levels was reasonably small, par- 
ticularly for ears with ASSEP thresholds of 280 dB 
HL, where the SD values for most carrier frequen- 
cies were approximately 5 to 10 dB (Table 2). 


All of the children with sensorineural hearing loss 
in this series were given amplification on the basis 
of their ASSEP results within 1 month of the evoked 
potential assessment (mean age, 4 months). A con- 
servative fitting strategy was used whereby the be- 
havioral threshold was presumed to be at a level 10 
to 15 dB better than the ASSEP threshold. This level 
broadly corresponds to the ASSEP threshold minus 
the mean ASSEP-behavioral difference level (ap- 
proximately 5 dB in this study) and minus 1 SD of 
the difference level (5 to 10 dB). No tolerance prob- 
lems or instances of significant overamplification 
were observed with this technique. 


The results obtained from subjects with profound 
sensorineural hearing loss in this study demonstrate 
the usefulness of the ASSEP procedure for confir- 


mation of cochlear implant candidacy. Cochlear im- 
plants have in recent years become an important treat- 
ment option for adults and young children with se- 
vere to profound hearing loss, and in young candi- 
dates, it is the amount of residual hearing that is the 
primary (audiological) criterion. In ears with residual 
hearing at profound levels (>90 dB HL), the contin- 
uous modulated tones (presented at levels as high as 
120 dB HL) typically elicited steady-state responses 
at presentation levels within 5 dB of audiometric 
threshold, and, as can be seen in Fig 3, were able to 
accurately mirror the audiogram configuration. Fur- 
thermore, ASSEP absence at maximum presentation 
levels was a strong indicator that a child did not have 
residual hearing at useful or amplifiable levels. We 
have previously demonstrated that the absence of the 
click-evoked ABR at maximum levels (100 dB nHL) 
does not necessarily exclude the presence of aidable 
hearing. !° 

The ASSEP findings for the 16 subjects with au- 
ditory neuropathy in this study were significantly 
poorer than those obtained for children with sensori- 
neural hearing loss. Although a response was record- 
able at high sensation levels in all cases, there was 
only a very weak relationship between the ASSEP 
and behavioral threshold measures in these children 
(Table 1). As such, the ASSEP technique appears to 
have little or no predictive value for hearing levels 
in children with auditory neuropathy. This result is 
consistent with reports that the generators of the high- 
rate (>70 Hz):ASSEP may be the same as those that 
produce the late components of the ABR.”:!4 Because 
the absence of the transient ABR is one of the signa- 
ture features of this disorder, it is not surprising that 
the ASSEP should also be degraded in children with 
auditory neuropathy. These findings raise the possi- 
bility that other forms of retrocochlear hearing loss 
or neurodevelopmental compromise in the auditory 
brain stem may also result in a discrepancy between 
ASSEP and behavioral threshold. 


One particular limitation of the automated steady- 
state evoked potential technique is that it cannot dif- 
ferentiate between hearing losses of peripheral (co- 
chlear) origin and those that are the result of neural 
transmission or retrocochlear disorders. Transient 
evoked potential techniques such as the ABR can 
make this determination on the basis of the presence 
or absence of short-latency outer hair cell responses 
(cochlear microphonics), as well as the morphology 
and latency of any remaining components of the neu- 
ral response. ASSEP testing, on the other hand, in- 
volves analysis in the frequency, rather than the time, 
domain and cannot differentiate between preneural 
and neural elements in the electroencephalogram. As 
such, we have found it important to use the ABR and 
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ASSEP techniques in concert in order to obtain ac- 
curate evoked potential assessment of young children. 


CONCLUSION 


This review of the first 200 infants diagnosed with 
permanent hearing loss by the ASSEP procedure at 
the University of Melbourne School of Audiology 
Clinic demonstrates that for children with sensorineu- 





ral hearing loss, the technique offers a means of ac- 
curately quantifying hearing levels. The poor pre- 
diction of threshold in children with auditory neur- 
opathy, however, reinforces the need for converging 
evidence from different measures of auditory system 
function, including otoacoustic emissions, cochlear 
microphonics, and behavioral testing in conjunction 
with ASSEP and other evoked potential tests, in or- 
der to diagnose and treat hearing loss in infancy. 
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ELECTRICAL STIMULATION OF THE AUDITORY NERVE VIA 
COCHLEAR IMPLANTS IN PATIENTS WITH AUDITORY NEUROPATHY 
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Auditory neuropathy (AN) is a term used to describe an auditory disorder in which there is evidence of normal outer hair cell 
function (otoacoustic emissions and/or cochlear microphonics) and poor function of the auditory nerve (absent or highly distorted 
auditory brain stem response starting with wave I). Many of these patients have evidence of generalized peripheral nerve disease, 
leading to an assumption that the peripheral portion of the auditory nerve is the most likely site of lesion. A small group of these 
patients has received cochlear implants, and the majority of them achieve average to above-average performance. Although this 
outcome may seem incongruous with neural disease, average performance by patients with AN may be a result of the reintroduction 
of neural synchrony by electrical stimulation and/or the fact that most deaf patients have poor nerve survival. Although cochlear 
implants are promising for deaf patients with AN, more study of the disorder is needed. 


KEY WORDS — auditory neuropathy, cochlear implant. 


DESCRIPTION OF AUDITORY NEUROPATHY 


The term auditory neuropathy (AN) was first used 
by Starr et al! in a 1996 publication. Auditory neurop- 
athy is a constellation of auditory symptoms that are 
analogous to optic neuropathy, a visual disorder in 
which the retina is intact but optic nerve function is 
impaired. 

Patients with AN have hearing loss, a poorly 
formed or completely absent auditory brain stem re- 
sponse (ABR) regardless of hearing thresholds, evi- 
dence of normal outer hair cell function in the form 
of either normal otoacoustic emissions or present 
cochlear microphonics, and dramatically reduced 
speech perception ability.2 Brain stem auditory re- 
flexes, including the middle ear muscle reflexes and 
olivocochlear reflexes, are absent in these patients 
because of the disruption of the afferent arc of the 
reflex.3 The pure tone hearing loss in patients with 
AN ranges from mild to profound; about one third 
show hearing thresholds in the severe to profound 
range. The loss is generally bilateral, although rare 
cases. of unilateral AN have been described. The au- 
diogram configuration is most often flat or rising, but 
other configurations, including high-frequency losses 
and bizarre, sawtooth-type configurations, have been 
seen. Patients with AN have very poor speech dis- 
crimination, even when the pure tone loss is mild or 
moderate. A poorly formed ABR and poor speech per- 
ception have been linked to poor temporal resolution 


‘in the auditory system that is probably caused by dys- 


synchrony at the level of the auditory nerve.!4.5 


SITE OF LESION 


The site of lesion in AN is difficult to determine. 
The presence of otoacoustic emissions and cochlear 
microphonics indicates that the outer hair cells are 
functioning. However, wave I of the ABR, which is 
generated in the most peripheral portion of the audi- 
tory nerve, is absent in patients who are character- 
ized as having AN. This portion of the auditory nerve 
is surrounded by peripheral myelin. The symptoms 
pinpoint the area of disturbance to either the inner 
hair cells (IHCs), the synaptic junction between the 
THCs and the auditory nerve, or the peripheral por- 
tion of the auditory nerve. 


Harrison®? and others have shown animal models 
in which the IHCs can be selectively damaged by car- 
boplatin or hypoxia while the outer hair cells are left 
intact. These animals showed persistent otoacoustic 
emissions. The ABRs in the chinchillas in these stud- 
ies, however, were not absent, but demonstrated ele- 
vated thresholds reflecting the degree of IHC loss. It 
should also be noted that to our knowledge, only one 
study has described human temporal bone dissection 
that revealed a normal complement of outer hair cells 
with selective IHC damage.’ These findings were con- 
fined to a small number of premature infants, but have 
not been seen in adult specimens. 


In a series from California, Starr? has shown that 
27% of patients with AN have evidence of concomi- 
tant peripheral sensory and/or motor neuropathy. 
When data from adult patients are evaluated as a 
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group, as many as 80% show evidence of peripheral 
neuropathy.’ Consequently, demyelinating and/or ax- 
onal neuropathy are the suspected pathophysiologi- 
cal mechanism of the majority of cases of AN. Sural 
nerve biopsies in 4 of Starr’s patients have shown 
evidence of axonal loss and demyelination and re- 
myelination of many of the remaining axons. Al- 
though the peripheral portion of the auditory nerve 
in patients with generalized peripheral nerve disease 
is the most obvious site of lesion, the true site of 
lesion remains unknown for AN patients without pe- 
ripheral neuropathy. 


The site of lesion has been more precisely shown 
as the primary portion of the auditory nerve in a few 
important examples of human postmortem temporal 
bone histology. Spoendlin!° found a normal comple- 
ment of hair cells with significant loss of auditory 
neurons and spiral ganglion cells in 2 sisters with 
low-frequency hearing loss, poor speech discrimina- 
tion, and a diagnosis of Friedreich’s ataxia. Patients 
with Friedreich’s ataxia and hearing loss have been 
described as having abnormal ABRs for the degree 
of hearing loss,!! which is a hallmark of patients now 
described as demonstrating AN. Nadol!? has likewise 
shown a primary loss of spiral ganglion cells in a 
case of Charcot-Marie-Tooth disease, also known as 
hereditary sensorimotor neuropathy, a disorder found 
in a number of patients who demonstrate symptoms 
of AN.!:!3 We now have postmortem biopsy speci- 
mens of the auditory nerve from | of our patients 
studied for AN. These biopsy specimens show a se- 
vere loss of neurons and demyelination and remyeli- 
nation of the remaining neurons.!4 Analysis of the 
cochleas revealed normal inner and outer hair cell 
survival. 


HOW CAN A COCHLEAR IMPLANT WORK ON A 
- DISEASED NERVE? 


Because of significant temporal processing dys- 
function, we have found that most patients with AN 
receive little or no benefit from traditional amplifi- 


cation, with some notable exceptions.!5 At the time 
of this manuscript preparation (February 2001), at 
least 20 children with AN have received cochlear im- 
plants.!6-18 With rare exceptions,!? the performance 
of these children with their device is good to excel- 
lent. Children with AN who receive a cochlear im- 
plant show reasonable progression of speech percep- 
tion abilities as measured on the Early Speech Per- 
ception Test, for example. In addition, neural re- 
sponse telemetry measures, which evaluate the au- 
ditory nerve compound action potential and growth 
function in response to electrical stimulation with 
an intracochlear electrode, are normal in these pa- 
tients. This has raised the question of how the elec- 
trical stimulation of a potentially diseased nerve could 
produce a meaningful signal, or in fact, any signal at 
all. 


The fact that many patients with AN can benefit 
from cochlear implantation should not be surprising 
when information on cochlear implantation of pa- 
tients with sensorineural hearing loss is reexamined. 
As has been repeatedly demonstrated, most patients 
with severe primary end organ (hair cell) disease have 
retrograde neural degeneration to some degree, and 
these patients are using cochlear implants. Zhou et 
al?! have shown that it is possible to elicit an ABR 
with electrical stimuli in mice with peripheral mye- 
lin disorder affecting the auditory nerve, and Leake 
et al??? have shown that electrical stimulation can 
be beneficial in promoting spiral ganglion cell sur- 
vival. 

The exact nature of the lesions in patients with 
AN cannot be determined while they are alive. How- 
ever, it is clear that many patients with the symp- 
toms of AN have a disease of the auditory nerve and 
yet can benefit from electrical stimulation for audi- 
tion. More study is needed to determine the range of 
success for cochlear implantation in these patients, 
but for now, there is reason to be optimistic regard- 
ing the use of implants in patients with symptoms of 
AN. 
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AUDITORY BRAIN STEM AND MIDBRAIN DEVELOPMENT AFTER 
COCHLEAR IMPLANTATION IN CHILDREN 
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Input to the central auditory system through a cochlear implant promotes psychophysical improvement of auditory skills. How- 
ever, the developmental changes along the pathways have never been characterized in children with hearing loss who use implants. 
We aimed to measure auditory development in such children by using the electrically evoked auditory brain stem response (EABR). 
We made repeated measures of the EABR in 41 nonsedated children with implants before chronic stimulation and after 2, 6, and 12 
months of consistent implant use. The results show that EABRs were present in all of the children even before chronic auditory 
stimulation, and that EABR wave latencies decreased from the time of initial activation throughout the first year of cochlear implant 
use. These findings reflect auditory development to the level of the midbrain as a result of the cochlear implant. The decreasing 
latencies likely reflect decreased neural conduction times at this level, in part because of increased synaptic efficacy. 


KEY WORDS — deafness, developmental plasticity, electrical stimulation, evoked potential, human, objective measure. 


INTRODUCTION 


In this study, we aimed to identify changes in cen- 
tral auditory activity after cochlear implantation in 
children with severe to profound sensorineural hear- 
ing loss. We were specifically interested in the de- 
velopmental plasticity of the system, particularly 
when auditory deprivation occurs during potentially 
critical periods in development. 


Severe to profound hearing loss occurring in early 
development prevents acoustic input from adequately 
stimulating the central auditory system, thus hinder- 
ing normal developmental processes!*4 and causing 
reorganization of the pathways.56 Deficits in neu- 
ronal structure and population size have been found 
at all levels of the pathway, from primary neurons to 
more central areas.”-!? Moreover, disruption of nor- 
mal auditory stimulation during development results 
in distortions of tonotopic maps in the auditory mid- 
brain and cortex.!3-!6 


A cochlear implant can effectively stimulate the 
auditory pathways with electrical pulses despite se- 
vere to profound hearing loss. It has been shown that 
when this type of stimulation is provided in neona- 
tally deafened animals, both normal and abnormal 
developmental changes are promoted along the cen- 
tral auditory pathways.!7-!9 Although such evidence 
of plasticity exists in animal models, the integrity of 
the auditory pathways and the ability of the neurons 
present to respond and develop remains unclear in 


children who experience hearing loss during devel- 
opment of the central auditory system. 


Auditory development in children with cochlear 
implants has been inferred from psychophysical mea- 
sures of speech perception. However, this type of test- 
ing cannot provide direct evidence of central audi- 
tory development, nor can it distinguish between au- 
ditory and receptive aspects of language develop- 
ment. Furthermore, data are limited to children who 
are old enough and have sufficient language levels 
to perform the tests. 


Normal human auditory development has been 
studied with auditory evoked potential recordings. 
Over the first 2 to 3 years of life, responses from the 
auditory brain stem and midbrain show changes such 
as decreasing latencies?” that are perhaps caused by 
increased myelination and improved synaptic effi- 
cacy.*!2? We have found that similar evoked poten- 
tial recordings in children who use implants can be 
made without the use of sedation, so that repeated 
measures can be made after the operation (unpub- 
lished observations). In this study, we used the elec- 
trically evoked auditory brain stem response (EABR) 
to assess changes in auditory activity over the first 
year of implant use in a large cohort of 41 children. 
We asked 1) How soon after implantation does the 
auditory system respond to the stimulation provided 
by a cochlear implant? and 2) Do the central audi- 
tory pathways show developmental plasticity after 
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Fig 1. Typical electrically evoked auditory brain stem responses (EABRs) evoked by cochlear implant electrode at basal end 
of array. EABRs were recorded at decreasing stimulus levels at 4 time points: A) initial device activation, B) 2 months of 
implant use, C) 6 months of use, and D) 1 year of use. Clear ell and eV waves are shown by solid lines, and minimum 





latencies measured at each time point are noted. nC — nanocoulombs. 


chronic electrical stimulation from a cochlear im- 
plant? 


MATERIALS AND METHODS 


Subjects. In this prospective study, our subject co- 
hort consisted of 41 children with severe to profound 
sensorineural hearing loss who had been approved 
for implantation through a rigorous assessment in- 
cluding a Graded Profile Analysis.23 Of these chil- 
dren, 35 had congenital severe to profound hearing 
loss, and the other 6 had acquired severe to profound 
losses caused by meningitis in 4 cases, unknown sud- 
den hearing loss in 1 case, and progressive hearing 


loss in 1 case. Of the 35 children with congenital 
deafness, 23 underwent implantation before 5 years 


_ of age (1 to 4 years), and the other 12 underwent 


implantation at older ages (5 to 14 years). Of the 6 
children with acquired deafness, 5 underwent implan- 
tation less than 2 years after the onset of severe to 
profound hearing loss (at 2 to 4 years of age), and 
the other underwent implantation after 16 years of 
deafness (at.17 years of age). All children underwent 
implantation with either the Nucleus 24 implant or 
the 24 Contour implant. 


Electrically Evoked Auditory Brain Stem Response 
Recording Protocol. Electrically evoked potential re- 


34 Gordon et al, Central Auditory Development in Children With Cochlear Implants 


Wave eV 











-1 0 1 
Time After Device Activation (months) 


A 








2324 6 6 7 8-9 10 11 12 13 
B 


Wave elll 


26 


N 
P 





Latency (ms) 
N 
N 





N 








1.8 
1012 3 4 5 6 7 8 9 10 11 12 13 
Time After Device Activation (months) 


Fig 2. EABR latencies measured over time in same child as in Fig 1. A) Wave eV. B) Wave elll. Both waves clearly decrease 


over time of implant use. 


sponses were recorded with the NeuroScan 4.0 soft- 
ware with a Synamps amplifier. The noninverting 
electrode was placed at Cz (International 10-20 Sys- 
tem), and inverting electrodes were placed on the ear- 
lobes both ipsilateral and contralateral to the im- 
planted ear. Electrode leads passed through an ana- 
log low-pass filter (frequency cutoff, 32 kHz) be- 
fore connecting with the preamplifier. On-line filter 
settings restricted recorded frequencies to a 10- to 
3,000-Hz bandwidth. The recording stimulus con- 
sisted of electrical pulses delivered by an implant elec- 
trode at the basal end of the array at a rate of 11/s. 
The stimulation levels ranged from comfortably loud 
levels to electrophysiological threshold and yielded 
a series of waveforms at each test time. The record- 
ings were made in nonsedated children while they 
were watching a movie either seated alone or on a 
parent’s or caregiver’s lap. 


Data Collection. Each child was seen several times 
over the first year of implant use. The first recording 
was made shortly after initial activation, usually on 
the day the device was activated and always before 
chronic stimulation. Repeated recordings were made 
after 1, 6, and 12 months of implant use. Data pre- 
sented in this paper were collected from 41 children 
followed for at least 2 months (15 were followed for 
l year, 17 for 6 months, and 9 for 2 months). 


Data Analysis. The latency (time after stimulus on- 
set) was measured for waves elll and eV for each 
waveform recorded within the range of stimulus lev- 
els. The minimum latency obtained for each series 
of waves at each test time was used for statistical 
analysis. Repeated-measures analyses of variance 
were performed on elll and eV latency data from the 
31 children who were followed for at least 6 months. 


RESULTS 


Anexample of EABR waveforms at all 4 test times 


over the first year of implant use is shown in Fig 1. 
These responses were obtained in a child with con- 
genital severe to profound sensorineural hearing loss 
who underwent implantation at 3.5 years of age and 
was followed for | year. The waveforms seen here 
are typical of those collected in all of the subjects. 
At each test time, both peaks elII and eV are clear at 
high stimulus levels and become smaller in ampli- 
tude as the stimulus level decreases, until no response 
can be seen. There is only a slight fluctuation in peak 
latencies; the minimum latency measured is noted. 
Replicable EABR waveforms were obtained in all 
41 children at initial device activation (ie, before 
chronic stimulation), as well as at later time points. 


A close look at the waveforms shown in Fig | re- 
veals that both eIl and eV latencies decrease over 
time after device activation. Figure 2 depicts the de- 
crease in both elll and eV latencies for the same child 
as in Fig | as a function of duration of implant use. 
These results were typical of those found in other 
children. Averaged data of the entire subject cohort 
appear in Fig 3. Figure 3A shows clear decreases in 
eV latency over the time of implant use. The latency 
of wave elll, shown in Fig 3B, also shows a decrease 
over time. Figure 3C illustrates the decrease in inter- 
wave latency of elll-eV over time. 


The latency changes, with time after implantation, 
of waves eV and elll and the interlatency changes of 
elll-eV were statistically significant (p < .0001, p < 
.05, and p < .0001, respectively). Significant changes 
were found even between the first time point and the 
second, 2 months after device activation (post hoc 
testing at p < .05). 


DISCUSSION 


Our finding of clear EABR waveforms in all chil- 
dren upon initial activation of the cochlear implant 
(before chronic stimulation) indicates that the func- 
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Fig 3. Over time of implant use, clear decreases are shown 
in A) mean EABR eV latencies, B) mean EABR elll la- 
tencies, and C) mean EABR elll-eV interlatencies. Re- 
peated-measures analyses of variance show that these 
changes are significant (p < .0001 in A, p < .05 in B, and 
p < .0001 in C) even after only 2 months of implant use 
(post hoc testing at p < .05). 


tion of the auditory brain stem and midbrain is main- 
tained despite auditory deprivation during potentially 
critical periods in development. Nadol® reported that 
in humans, spiral ganglion survival is particularly 
poor when hearing loss is congenital or caused by 
bacterial meningitis (mean cell survival counts ap- 
proximately 45% of normal mean), whereas better 
neural survival occurs with other causes of hearing 


loss (ie, approximately 80% survival in the case of 
aminoglycoside ototoxicity). Moreover, Nadol found 
that degeneration was most severe at the basal end 
of the cochlea. In the present study, we found that 
despite the likelihood of compromised spiral gangli- 
on integrity in children with severe to profound hear- 
ing loss, the electrical stimulation provided by a basal 
cochlear implant electrode successfully evokes syn- 
chronous auditory activity, at least at the level of the 
brain stem and midbrain. Moreover, this organized 
pattern of response is elicited before chronic electri- 
cal stimulation. 


In previous work, we showed that the maximum 
amplitudes of EABRs recorded in children and adults 
with implants were larger than auditory brain stem 
response amplitudes evoked by an acoustic click 
stimulus in normal-hearing subjects (unpublished ob- 
servations). We suggest that neural synchrony, needed 
for evoked potential recordings, is likely greater in 
adults and children with implants than in normal- 
hearing individuals, because the auditory nerve is di- 
rectly stimulated with a rapid-onset electrical pulse. 
Thus, even if there is a smaller number of stimulated 
neurons available, this increased synchrony of activ- 
ity may allow aclear EABR response to be recorded. 


Whatever deficits exist along the auditory path- 
ways in children with hearing loss before implanta- 
tion, we have shown here that the auditory brain stem 
and midbrain are capable of developmental changes 
with ongoing electrical stimulation. After chronic 
stimulation, the neural generators of waves elI and 
eV fire sooner after stimulus presentation, as shown 
by significant latency decreases of eIl and eV. More- 
over, the pathway from generators of elll to more 
central neurons that contribute to eV becomes faster, 
as shown by decreasing interwave latencies after im- 
plantation. 


Decreasing neural conduction times, as shown by 
decreasing auditory brain stem response wave and 
interwave latencies, are well documented in children 
with normal hearing over the first 2 to 3 years of 
life.20.24.25 The underlying mechanisms are thought 
to include an increase in myelination and improved 
synaptic efficacy of the auditory pathways, at least 
to the level of the midbrain.*!.22.26.27 Our findings of 
reduced latencies may also be a result of similar types 
of developmental changes. 


It is possible that some degree of myelination oc- 
curs in children with severe to profound deafness be- 
fore implantation, as evidenced by studies in animal 
models that showed myelin formation occurring be- 
fore the onset of cochlear function.2® However, in 
the absence of stimulation, the auditory nerves are 
subject to degeneration,”!* which would likely in- 
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clude disruption of any established myelin cells. 
Stimulation of new myelin formation may be depen- 
dent on the type of input provided.29-3! Thus, the in- 
troduction of electrical stimulation of the auditory 
pathways in children who use cochlear implants may 
promote myelination and/or minimize further mye- 
lin depletion. 


Stimulation from the cochlear implant may also 
promote changes at a synaptic level. It is known that 
in normal development of the auditory and other sys- 
tems, synaptic changes along the pathways allow for 
more efficient relay of information from one nerve 
to another.*? The underlying mechanisms include 
cell apoptosis??? and modifications of the cell recep- 
tors.°+-36 For example, the slow-acting NMDA (N- 
methyl-D-aspartate) receptors, predominant in early 
development, may take a secondary role to faster- 
acting AMPA (a@-amino-3-hydroxy-5-methy]-4-isox- 
azolepropionate) receptors, which are dominant in 
adulthood.3’ Hebbian principles suggest that these 
changes are unlikely to occur in the absence of stimu- 
lation. 


It should be noted that electrical stimulation of the 
auditory nerve through a cochlear implant is a unique 
type of input to the system. The auditory system has 
been shown to produce abnormal cortical frequency 
maps!° and broader frequency tuning of the eighth 
nerve and inferior colliculus?®3° in response to un- 
usual inputs presented chronically during develop- 
ment. Therefore, the reductions in neural conduction 
time that we have shown in children who use co- 
chlear implants could potentially result from both 
normal and abnormal processes. 


Although we have shown that the auditory path- 





ways to the level of the midbrain are functional in 
children with sensorineural hearing loss, at least half 
of these nerves are probably subject to degeneration 
during the period of auditory deprivation. Future 
work with electrically evoked potentials should fo- 
cus on what effects prolonged periods of auditory 
deprivation have on how the surviving nerves re- 
spond and develop in response to electrical stimula- 
tion from a cochlear implant. In this way, we might 
better understand why children who undergo implan- 
tation at younger ages tend to achieve higher levels 
of speech perception than their older peers. 


CONCLUSIONS 


Using electrically evoked potential recordings, we 
have shown that an organized pattern of central au- 
ditory activity to the level of the midbrain is evoked 
by electrical stimulation from a cochlear implant in 
children. Moreover, these pathways show develop- 
mental changes with chronic stimulation provided 
by ongoing use of the implant. On the basis of these 
findings, we suggest that auditory activity and devel- 
opment in children with cochlear implants can be 
objectively measured in a noninvasive way, without 
the use of sedation, and in children of a wide range 
of ages. The development of objective outcome mea- 
sures in the pediatric cochlear implant population 
could potentially aid assessment of new device tech- 
nologies, monitoring of progress even in very young 
children, and identification of factors that contribute 
to outcome. Moreover, evoked potentials provide evi- 
dence of central auditory activity through which the 
developmental plasticity of the human central audi- 
tory system can be further defined in children with 
implants. 
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The goal of this study was to determine whether there is a sensitive period during early development when a cochlear implanta- 
tion can occur into a minimally degenerate and/or highly plastic central auditory system. Our measure of central auditory deprivation 
was latency of the P1 auditory evoked potential, whose generators include auditory thalamocortical areas. Auditory evoked poten- 
tials were recorded in 18 congenitally deaf children who were fitted with cochlear implants by 3.5 years of age. The P1 latencies of 
the children with implants were compared with the P1 latencies of their age-matched peers with normal hearing. There was no 
significant difference between the P1 latencies of the children with implants and the children with normal hearing. The present 
results suggest that early implantation occurs in a central auditory system that is minimally degenerate and/or highly plastic. Studies 
are ongoing to assess the consequences to the developing central auditory system of initiating electrical stimulation at later ages. 


KEY WORDS — auditory evoked potential, cochlear implant, development, P1 response, sensitive period. 


INTRODUCTION 


The effects of sensory deprivation caused by con- 
genital deafness on the development of human audi- 
tory pathways are not well understood. Because con- 
trolled deprivation studies in human infants are un- 
ethical, the time course of human central auditory 
system development in the absence of sound is un- 
known. However, the advent of cochlear implanta- 
tion has allowed researchers to initiate afferent stimu- 
lation of auditory pathways in children who have 
been deaf since birth. Ponton et al!.? and Eggermont 
et al’ have examined latencies of the P1 auditory 
evoked potential in children and adults with cochlear 
implants to infer the maturational status of the cen- 
tral auditory pathways in the presence of auditory 
deprivation. 


The P1 response reflects activity from auditory thal- 
amic and cortical sources.*> Because the latencies of 
the Pl component of cortical evoked potentials re- 
flect the sum of transmission and synaptic delays 
along peripheral and central auditory pathways, these 
latencies can provide a measure of the effects of au- 
ditory deprivation on the pathways. Ponton et al! ex- 
amined the latencies of the P1 auditory evoked poten- 
tial in normal-hearing children and adults and in chil- 
dren and adults with cochlear implants. For the chil- 
dren with implants, the average duration of deafness 
was 4.5 years. The children were also subdivided into 
those who had experienced short (1.1 years), medium 


(4.9 years), and long (8.5 years) periods of depriva- 
tion. The investigators reported that although the over- 
all latencies of the P1 response were delayed in chil- 
dren with cochlear implants, the rates of maturation 
for the P1 latency in these children were the same as 
those in normal-hearing children. On the basis of their 
criterion for maturation of the P1 (ie, 15 years), they 
extrapolated that mature latencies would not be 
reached until the ages of 17, 20, and 25.5 years for 
children with implants who had short, medium, and 
long periods of deafness. 


The same group reported that the average time 
to the maturation of P1 latencies was delayed by an 
additional 5 years for a group of children with im- 
plants whose mean duration of deafness was 4.5 
years. Overall, that group surmised from their data 
that the maturational delays in reaching adult P1 la- 
tencies correspond closely with the mean duration 
of auditory deprivation before implantation superim- 
posed on the normal maturational time frame. On the 
basis of these findings, Eggermont et al? suggested a 
model of auditory development in the presence of 
deprivation. They suggested that the auditory sys- 
tem does not mature without stimulation. Deafness 
“freezes” the synapses in an immature state, and co- 
chlear implantation “thaws” the system out of its fro- 
zen state. after which it resumes its normal matura- 
tional sequence.* According to this view, central au- 
ditory system maturation starts after electrical stimu- 
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lation via a cochlear implant is initiated. 


On the other hand, research on the effects of audi- 
tory deprivation on the development of auditory path- 
ways in a variety of animal models indicates that the 
degenerative effects of deafness take time to develop. 
These studies show that rudimentary cochleotopic 
tuning and basic temporal response properties in the 
inferior colliculus and primary auditory cortex de- 
velop in the absence of auditory input.*9 Thus, audi- 
tory pathways must begin to develop normally in the 
absence of sound stimulation. Ryugo’s group!®-12 
have shown a morphological basis for this idea. They 
examined the development of the end-bulbs of Held 
in the cochlear nucleus (which are axosomatic and 
myelinated auditory nerve fibers that end in spheri- 
cal bushy cells) in congenitally deaf white cats and 
congenitally deaf Shaker-2 mice. They reported that 
although the end-bulbs of Held in deaf animals be- 
gin to develop normally, by young adulthood they 
exhibit major alterations in synaptic development that 
are clearly distinguishable from the end-bulbs in nor- 
mal-hearing littermates and that would impair the 
timing resolution of synaptic transmission. On the 
basis of these animal studies that suggest that degen- 
erative changes in sensory pathways take time to de- 
velop, in combination with studies that show that 
young nervous systems exhibit a high degree of neu- 
ral plasticity,!>14 we may speculate that implanta- 
tion before or just after the onset of these degenera- 
tive changes may not result in lasting delays in matu- 
ration of the central auditory system. 


Our goal was to determine whether in humans there 
is a sensitive period during early development dur- 
ing which an implant can be placed into a minimally 
degenerate and/or highly plastic central auditory sys- 
tem, or, conversely, whether central auditory matura- 
tion will show lasting delays even if an implant is 


introduced at an early age. To answer this question, 
we examined the development of P1 latencies in pre- 
lingually deaf‘children who underwent implantation 
relatively early in life, ie, before the age of 3.5 years. 


MATERIALS AND METHODS 


Subjects. Cortical auditory evoked potentials were 
recorded from 18 prelingually deaf children with co- 
chlear implants ranging in age from 1.9 to 6 years 
(mean age, 3.8 years). The age of fitting with the 
cochlear implant ranged from 15 months to 3.5 years 
(mean age, 2 years). Experience with the implant 
ranged from 4 months to 4.1 years (mean, 1.8 years). 


Our comparison group comprised 18 normal-hear- 
ing children spanning roughly the same age range as 
the group with cochlear implants (see Figure, inset). 
The normal-hearing children ranged in age from 2.4 
to 4.7 years (mean age, 3.5 years). None of the nor- 
mal-hearing subjects had a history of neurologic, hear- 
ing, speech, or language problems. Our data were 
taken from a larger data set of 124 normal-hearing 
children in whom we examined the development of 
the P1 response, !5.16 


Stimulus Presentation. Cortical auditory evoked 
responses were recorded in response to a synthesized 
speech syllable /ba/. The duration of the speech sound 
was 90 ms, and it was presented at an interstimulus 
interval of 500 ms. A full description of the stimulus 
has been published.!> The stimulus was presented 
via a loudspeaker placed at an angle of 45° to the 
right of the normal-hearing subjects. For the subjects 
with implants, the speaker was placed on the im- 
planted side. The processors were set to the children’s 
usual settings. 


Evoked Response Recording Procedures. The sub- 
jects were seated comfortably in a reclining chair 
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placed in a sound booth. The younger children were 
seated on a parent’s lap. The subjects watched a vid- 
eotaped movie or cartoon of their choice on a televi- 
sion monitor placed in front of them in the sound 
booth. The videotape audio levels were kept below 
45 dB sound pressure level. We have found this to be 
an effective way of engaging young subjects. Evoked 
potentials were collected with Cz as the active elec- 
trode. The reference electrode was placed on the right 
mastoid, and the ground on the forehead. Eye move- 
ments were monitored with a bipolar electrode mon- 
tage (lateral outer canthus referenced to superior outer 
canthus). For children with implants, the reference 
electrode was placed on the nonimplanted ear, and 
the eye-blink electrode was placed on the nonim- 
planted side. 


Averaging was automatically suspended by the re- 
cording computer when eye blinks were detected. 
The recording window included a 100-ms prestimu- 
lus time and a 600-ms poststimulus time. Incoming 
evoked responses were analog-filtered from 0.1 to 
100 Hz. Approximately 300 response sweeps were 
collected for each subject. The test session, includ- 
ing the electrode application and the evoked response 
recording, lasted for about 30 minutes. 


Data Analysis. Sweeps greater than 100 uV or less 
than —100 LV were rejected offline; then, the remain- 
ing Sweeps were averaged to compute a grand aver- 
age waveform for individual subjects. P1 was de- 
fined as the first robust positivity in the auditory 
evoked potential waveform in the 50- to 175-ms 
range. Latency values were computed for P1 in indi- 
vidual subjects. 


RESULTS 


We have previously shown that the P1 latency de- 
creases in a significant and complex manner from in- 
fancy to adolescence in normal-hearing children. !5-16 
Consistent with this trend, in the present study, the P 1 
latencies of the group with cochlear implants showed 
an overall age-related decrease (r = —0.5; p < .05). 
Overall, children with cochlear implants showed well- 
formed P1 responses. A t-test revealed that there was 
no significant difference in the P1 latencies between 
the normal-hearing children and the children with co- 
chlear implants (t = 0.5; p = .6); another t-test showed 
no significant difference between the ages of the two 
groups (f = 1.03; p = .3). The P1 latencies and age 
distributions of the two groups are shown in the Fig- 
ure. 


DISCUSSION 


Our results show that when afferent stimulation is 
restored to children who have experienced less than 


3.5 years of auditory deprivation, the P1 cortical 
evoked potential evidences normal latencies within 
6 months of implant use. On the basis of this result, 
we suggest that early implantation occurs in a cen- 
tral auditory system that is minimally degenerate and/ 
or highly plastic. 


On the basis of our measure of auditory depriva- 
tion and development (ie, the latency of the P1 evoked 
potential), and given that the children in our sample 
were prelingually deaf, we can entertain two specu- 
lations to explain our results. One possibility is that 
central auditory pathways begin developing normally 
and remain minimally degenerate after periods of au- 
ditory deprivation lasting up to 2 to 3 years. We note 
that this period corresponds closely to the period of 
intrinsically driven synaptogenesis, or synaptic pro- 
liferation, in the auditory cortex.'? Perhaps synapto- 
genesis protects central pathways from measurable 
effects of deprivation. 


It is also possible that early implantation occurs 
in a highly plastic system in which the effects of dep- 
rivation are overcome in a relatively short period of 
time. There is evidence for this possibility from the 
visual system. Maurer et al!4 assessed visual acuity 
in human infants who were congenitally deprived of 
patterned visual input by cataracts. The cataracts were 
removed at | week to 9 months of age. Maurer et al 
found that acuity improved rapidly, with some im- 
provement apparent after as little as | hour of visual 
input. Furthermore, their data suggest that the rate 
of development of visual acuity after cataract removal 
was significantly greater than normal. We cannot say, 
on the basis of our data, whether an analogous pro- 
cess occurs in the auditory system after cochlear im- 
plantation. 


It is interesting to consider our results in the light 
of previous studies by Ponton’s group,! who have 
suggested that there is a maturational delay in P1 
latency for children with cochlear implants. The chil- 


‘dren in that group’s studies had a mean duration of 


deafness on the order of 4.5 years. Consistent with 
results from Ponton’s group, preliminary data from 
our laboratory show that some children, but not all 
children, who have been deaf 4 years or more show 
longer-than-normal cortical response latencies. Those 
findings, considered together with our present find- 
ing that children who undergo implantation by 3.5 
years of age demonstrate age-appropriate cortical re- 
sponse latencies, suggest that implantation performed 
before 3 to 4 years of age occurs in a minimally de- 
generate central auditory system. Studies are ongo- 
ing in our laboratory to better define the time course 
of the plasticity of the central auditory system fol- 
lowing implantation. 





4) 
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The minimum age for cochlear implantation has been reduced to 12 months in an effort to provide auditory stimulation to 
children with hearing loss during early development. Because behavioral measures in such young children are limited, objective 
measures such as the electrically evoked compound action potential (EAP) from the auditory nerve are needed to facilitate measure- 
ment of stimulation level requirements. We assessed EAPs recorded by the Nucleus 24 neural response telemetry (NRT) system in 
children who underwent implantation between 12 and 24 months of age. We recorded EAPs in 37 such children (mean age at implan- 
tation, 18.1 + 3.6 months). The EAPs were of large amplitude, and thresholds fell between behavioral T and C levels. A correction 
factor applied to EAP thresholds provided useful predictions of T levels. The EAPs can be used to ensure that even very young children 
receive auditory stimulation with their cochlear implants upon device activation. 


KEY WORDS — auditory nerve, child, cochlear implant, compound action potential, deafness, developmental plasticity, elec- 


trical stimulation, human, infant, objective measure, spiral ganglion. 


INTRODUCTION 


In this study, we assessed the usefulness of elec- 
trically evoked auditory nerve compound action po- 
tentials (EAPs) recorded by the Nucleus 24 neural 
response telemetry (NRT) system in children between 
12 and 24 months of age who use a cochlear im- 
plant. We asked whether EAPs could be recorded in 
this population by means of the NRT system and as- 
sessed how to predict behavioral measures of stim- 
ulation level requirements from the EAP responses. 


Children with severe to profound sensorineural 
hearing loss are able to receive auditory stimulation 
from electrical pulses delivered by a cochlear im- 
plant. With ongoing use of this device, such children 
are able to develop perception of speech to varying 
degrees.!-3 It has been shown that children with pre- 
lingual deafness achieve higher levels of speech per- 
ception skills as the age at implantation decreases.+-7 
These psychophysical findings suggest the presence 
of critical periods in auditory plasticity.8-? Because 
of this information, candidacy criteria for implanta- 
tion have been expanded to include children between 
12 and 24 months of age. 


The cochlear implant population between 12 and 
24 months of age provides both surgical and postop- 
erative challenges. Once such an infant receives a 
cochlear implant, appropriate stimulation levels must 


be determined for the device to be programmed. The 
program (MAP) is composed of threshold levels (T 
levels) and comfortably loud levels (C levels) at each 
active electrode. The child’s progress must also be 
monitored. In older implant users, psychophysical 
measures are usually used both to establish optimal 
MAP levels and to monitor progress. In very young 
children, however, psychophysical measures might 
not be easily obtained. Objective measures, including 
evoked potential recordings (ie, the electrically 
evoked auditory brain stem response [EABR]),!0!4 
stapedius reflex measures, !5-!6 and EAPs,!0:!7-29 have 
been shown to reliably predict behavioral measures 
of stimulation requirements in both children and 
adults, and thus may be useful tools for managing 
infants with implants. However, some degree of co- 
operation is required for these tests when they are 
performed after the operation. We have found that 
EABRs can be recorded in awake children (unpub- 
lished observations), but this technique requires place- 
ment of cephalic surface electrodes and limited move- 
ment of the child. Stapedius reflex measurements re- 
quire the child to sit relatively quietly with a probe 
in his or her ear. In comparison, the EAP measured 
by the Nucleus 24 NRT system requires only the stan- 
dard programming equipment setup (the child wears 
his or her external equipment, which is interfaced to 
the programming computer) and is less susceptible 





From the Cochlear Implant Program, Department of Otolaryngology, The Hospital for Sick Children and the University of Toronto, Toronto, 
Canada (Gordon), the Houston Ear Research Foundation, Houston, Texas (Gilden), and the Department of Otolaryngology, New York Univer- 
sity Medical Center, New York, New York (Shapiro). Ms Ebinger is employed by Cochlear Corporation, Denver, Colorado. She was a Senior 
Clinical Studies Specialist when this article was submitted and is currently a consultant for the company. The EAP data from the Hospital for 
Sick Children were collected as part of a study funded by the Physicians’ Services Incorporated Foundation of Ontario. 


CORRESPONDENCE — Karen A. Gordon, MA, Cochlear Implant Laboratory, Room 6D08, The Hospital for Sick Children, 555 University Ave, 


Toronto, ON, Canada, MSG 1X8. 


42 





k 











Gordon et al, Neural Response Telemetry in Infants 43 


to myogenic artifacts, including vocalizations, than 
either EABR or stapedius reflex tests. The EAP may 
therefore be the most effective objective measure to 
use in infants with implants. 


The Nucleus 24 NRT system has been shown to 
provide reliable EAP responses in adults?! and chil- 
dren (mean age, 6 years 7 months).!7 However, there 
have been no reports of NRT responses in children 
who underwent implantation between 12 and 24 
months of age. In this study, we asked 1) Can the 
Nucleus 24 NRT system record EAPs in children who 
undergo implantation between 12 and 24 months of 
age? 2) Do these measures differ from those in adults? 
and 3) Can these measures be used to predict behav- 
ioral measures of stimulation levels? 


METHODS 


Subjects. The EAP recordings were made in 37 
children who underwent implantation with Nucleus 
24 devices between 12 and 24 months of age (mean 
age, 18.1 + 3.6 months). Each child underwent im- 
plantation and was followed at one of the following 
implant centers: The Hospital for Sick Children, To- 
ronto; the Houston Ear Research Foundation; the 
New York University Medical Center; the Univer- 
sity of Iowa; or the University of Michigan Medical 
Center. Three of the 37 children underwent implan- 
tation on the left side, and there were 20 girls and 17 
boys. Interestingly, in many cases (n = 12), the cause 
of hearing loss was known, which may explain why 
diagnosis and intervention of these infants occurred 
despite the absence of a universal screening program 
in 3 of the 5 implant center regions. 


Neural Response Telemetry Recordings. The EAPs 
were recorded with the Nucleus 24 NRT system. Re- 
sponses were evoked by 3 electrodes in.each child: 1 
electrode at the apical end of the array (E20 in most 
cases, or E22 or E17), 1 electrode at the basal end 
(E3 in most cases, or E5), and 1 electrode in the mid- 
dle of the array (E9 or E10 in most cases, or E12, 
E13, or E15). The recordings were made after oper- 
ation, with the exception of 3 children in whom re- 
sponses were obtained immediately after device in- 
sertion in the operating room. The mean duration of 
implant use at the time of recording was 8.4 months. 
The postoperative recordings lasted 30 to 90 min- 
utes, during which time the child, seated on a par- 
ent’s or caregiver’s lap, either watched a videotape 
or played with toys. 


The NRT uses a forward masking paradigm in 
which a probe pulse is delivered at a specified delay 
to the masker in order to take advantage of the neu- 
ral refractory period. The response to the probe in this 
condition is subtracted from the response to the probe 
alone in an effort to eliminate the stimulus artifact. 


This technique is described in detail elsewhere. !0 


All responses were recorded via a monopolar mode 
of stimulation; the stimulating electrode pair com- 
prised an intracochlear electrode referenced to the ex- 
tracochlear ball electrode placed under the temporalis 


‘muscle (MP1), and the recording pair comprised an 


intracochlear electrode 2 electrode locations apical 
to the intracochlear stimulating electrode referenced 
to the extracochlear plate electrode located on the re- 
ceiver-stimulator (MP2). Electrical pulses were de- 
livered at a rate of 80 Hz, and the masker advance 
(time between masker and probe) was 500 us. The 
pulses were 25 us in duration both for EAP and be- 
havioral measures, with the exception of 1 child who 
required a pulse width of 50 to 100 us for both. Ini- 
tial recordings were made with an amplifier gain of 
60 dB and a sampling delay of 50 us. If these set- 
tings did not yield an identifiable response, an opti- 
mizing program was run in which the stimulus level 
remained constant for both masker and probe, the 
amplifier gain was either 60 or 40 dB, and the sam- 
pling delay varied from 50 to 125 us. The settings 
that provided the clearest response were then used. 


The stimuli were presented at comfortably loud 
levels. These levels were determined by presenting 
ascending stimulus levels and watching for any be- 
havioral signs of discomfort from the child. The max- 
imum level at which the child was comfortable was 
used for recording. The probe pulses were attenuated 
in 5 stimulating unit steps while the masker pulses 
were held either 10 stimulation units above the probe 
level or at a constant level in order to obtain an am- 
plitude growth function and to find the threshold of 
response. The threshold was determined as the mini- 
mum probe level that produced a visible response. 
This method has been shown to correlate strongly 
with objective measures of threshold estimation.!7 


A typical EAP, shown in Fig 1A, consists of a nega- 
tive amplitude peak, termed N1, often followed by a 
positive amplitude peak, P2. Each peak was mea- 
sured for latency and amplitude. In cases in which 
no positive peak was visualized, as shown in Fig 1B, 
the point at which the response tended toward base- 
line after the N1 was chosen as P2. Response ampli- 
tudes were measured as the absolute difference be- 
tween the N1 and P2 amplitudes. The maximum am- 
plitude of N1-P2 and the minimum N1 latency ob- 
tained within the recording range were used for sta- 
tistical analyses. 


Behavioral Measures of Threshold and Comfort- 
ably Loud Stimulation Levels (MAP Levels). Behav- 
ioral measures of required stimulation levels made 
within 3 months of EAP recordings for correspond- 
ing stimulating electrodes (1 at the apical end, 1 in 
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Probe Current Level 


Fig 1. Electrically evoked auditory nerve com- 
pound action potentials (EAPs) recorded with 
Nucleus 24 neural response telemetry (NRT) 
system with decreasing probe current level. 
EAP threshold, indicated in box, is last probe 
level at which response can be visualized. A) 
Clear NI and P2 waves are indicated by long 
and short dashed lines, respectively, in child 
who underwent implantation at 12 months of 
age. EAP amplitude is measured as absolute 
difference between N1 and P2 amplitudes. B) 
Clear N1 is indicated by long dashed line in 
child who underwent implantation at 18 months 
of age. No clear P2 is seen. Solid line indicates 
point at which response returns to baseline af- 
ter NI. Amplitude at this point is used as N2 
amplitude for purpose of EAP amplitude mea- 
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the middle, and 1 at the basal end of the array) were 
used for statistical analyses. Threshold stimulation 
levels (T levels) were measured with either condi- 
tioned play or behavioral observation audiometry by 
audiologists experienced in these techniques. Com- 
fortably loud levels (C levels) were determined by 
increasing T levels by 15 to 40 units and ensuring 
that these levels did not cause any negative behav- 
ioral responses from the child either on individual 
electrodes or when all electrodes were active together. 
The dynamic range between T and C levels tended 
to be increased as the child experienced longer dura- 
tions of implant use. The median stimulus MAP rate 
was 900 Hz, and the range was 250 to 1,200 Hz. 


Predicted T Levels. Predicted T levels were deter- 
mined by the method described by Hughes et al.!7 In 
this method, EAP thresholds measured by NRT are 
corrected by a factor calculated by subtracting the be- 
havioral T level obtained at 1 electrode in the middle 
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Fig 2. Mean EAP N1-P2 amplitudes for both adults and 
infants are shown for 3 electrodes. EAP amplitudes were 
significantly larger in infants than in adults for basal, mid- 
dle, and apical array electrodes (p < .0001, p < .01, and p 
< .0005, respectively). In infants, amplitudes were larg- 
est for apical electrode (p < .00005). 
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of the implanted array (ie, E10) from the EAP mea- 
sured at the same stimulating electrode. These inves- 
tigators found that by subtracting this factor from EAP 
thresholds, the correlation between predicted and 
measured T levels was better than that between EAP 
thresholds and either T or C levels. They recom- 
mended using this procedure to predict behavioral 
levels of stimulation in cases in which behavioral 
responses were limited. 


In the present study, EAP correction factors were 
determined by subtracting the behavioral T level mea- 
sured at the middle array electrode from the EAP 
threshold evoked by the same electrode. This correc- 
tion factor was then subtracted from the EAP thresh- 
olds measured at the apical and basal array electrodes. 


Adult EAP Data. The EAPs in 147 adults with post- 
lingual deafness were recorded at least 1 month af- 
ter operation as part of a separate study conducted in 
Europe with the Nucleus 24 NRT system. There were 
81 women and 66 men between 15 and 79 years of 
age who had a maximum of 4 years of implant use 
(ie, underwent implantation as adults or adolescents). 
The duration of deafness ranged from less than | year 
to 60 years. The NRT recording parameters were the 
same as those used in the children (as described 
above). One exception was that the masker level was 
kept at 5 stimulating units above the probe level. 
whereas in children, the masker level was either held 
at 10 stimulating units above the probe level or held 
constant. The active stimulating electrodes included 
a basal electrode (E3), a middle array electrode (E10), 
and an electrode at the apical end of the array (E20). 
The N1-P2 amplitudes and N1 latencies were mea- 
sured as indicated above. 


Data Analysis. The maximum amplitude and mini- 
mum latency differences between the 3 stimulating 
electrodes in children were assessed with repeated- 
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Fig 3. Mean N1 latencies at 3 electrode locations in in- 
fants and adults. Latencies were significantly shorter in 
infants than in adults for middle and basal array elec- 
trodes (p < .01 and p < .001, respectively). N1 latencies 
in both groups tended to be longest for basal electrode. 


measures analysis of variance or Friedman ’s repeated- 
measures analysis of variance on ranks with post hoc 
testing at p < .05 by the Student-Newman-Keuls 
method. Raw adult data were not available, and thus 
similar analyses could not be done. The amplitude 
and latency differences in the children were compared 
to adult mean and standard deviation data by use of 
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Fig 4. Threshold (T) and maximally comfortably loud (C) lev- 
els used in Nucleus 24 programming MAP are plotted against 
EAP thresholds obtained with NRT system for 3 electrodes. In 
many cases, these levels are based on limited behavioral re- 
sponses. Solid black line indicates point at which behavioral 
and NRT levels are equal. T levels tend to fall below this line, 
and C levels tend to fall above. Dashed lines indicate best-fit 
linear regression line for T and C levels separately. R values are 
shown and suggest weak correlation for both T and C levels 
with NRT thresholds. 


t-tests. The behavioral MAP levels were compared 
to the EAP threshold levels and the predicted levels 
by use of linear regression analysis, and the mean 
results were compared by use of paired t-tests or the 
signed rank test. 


RESULTS 


Figure 2 shows the maximum amplitudes of the 
EAP N1-P2 at 3 electrode locations for both the in- 
fant and adult groups. The amplitudes in the infant 
group are largest for the apical electrode (p < .00005). 
The amplitudes in the infants are significantly larger 
than those in the adults for the apical, middle, and 
basal array electrodes (p < .0001, p < .01, and p < 
.0005, respectively). 


Figure 3 shows the minimum latencies of N1 at 
3 electrode locations for both the infant and adult 
groups. The N1 latencies in both groups tended to 
be longest for the basal electrode; however, the laten- 
cy differences between electrodes were not signifi- 
cant in the infant group (p > .05). The latencies were 
significantly shorter in the infant group than in the 
adult group for the middle and basal array electrodes 
(p < .01 and p < .001, respectively). 


Within the infant group, the EAPs were weakly 
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Fig 5. Mean EAP thresholds and MAP T and C levels 
are shown for each of 3 electrodes. EAP thresholds fall 
between T and C levels. 


correlated to the behavioral levels of stimulation at 
all electrodes. Figure 4 shows the individual MAP 
level data with respect to the corresponding EAP 
thresholds at the apical, middle, and basal electrodes. 
The solid line indicates the point at which the MAP 
levels and EAP thresholds are equal. For all elec- 
trodes, the T levels tend to be lower than the EAP 
threshold, and the C levels tend to be higher. The 
mean data, shown in Fig 5, confirm that the EAP 
thresholds fall between the T and C levels. Figure 6 
shows that the mean EAP thresholds fall at 58%, 
61%, and 71% of the dynamic range between the T 
and C levels for the apical, middle, and basal elec- 
trodes, respectively. The mean middle array electrode 
threshold is 61%. 


The data shown in Fig 7!’ confirm that T levels 
predicted using a middle array correction factor pro- 
vide a good estimate of the mean behavioral T lev- 
els. In comparison, the EAP thresholds do not approx- 
imate the T levels. The differences between the mean 
predicted and the mean measured T levels, shown in 
Fig 7, are not significantly different for either the 
apical or basal electrode (p > .05). 
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Fig 6. EAP thresholds are shown as percentage of MAP 
dynamic range (stimulation units between T and C lev- 
els), in which T level represents 0% of range and C level 
represents 100%. 
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Fig 7. Mean EAP thresholds, T levels, and predicted T 
levels are shown for apical and basal array electrodes. 
Predicted T levels were obtained by method described 
by Hughes et al.!’ For each child (n = 37), correction 
factor was determined by subtracting T level of middle 
array electrode from EAP threshold of same electrode. 
This value was then subtracted from EAP thresholds at 
basal and apical array electrodes to give predicted mea- 
sure cf T level for corresponding electrode. 


DISCUSSION 


NRT Recording in Infants Implanted Between 12 
and 24 Months of Age. We found that the Nucleus 24 
NRT system is able to record reliable EAPs in in- 
fants who undergo implantation between 12 and 24 
months of age. We found that no special recording 
techniques were necessary in this population. How- 
ever, w2 have found that in children of all ages, the 
stimulation levels needed for EAP recording may ex- 
ceed comfort limits in the early stages of device use. 
We have tried various methods to ensure more com- 
fortable stimulation, such as providing ascending 
probe stimulation levels rather than descending lev- 
els for growth function recording. Recordings made 
in the operating room while the child is sedated are 
not lim ted by loudness comfort concerns and thus 
may be useful for predicting initial MAP settings. 
Providing suprathreshold stimulation in the first 
months of implant use can allow the child to grow 
accustomed to this new input. The dynamic range 
can subsequently be gradually increased. The NRT 
recordings can be remeasured once the child is more 
at ease with higher stimulation levels. 


Objective Measures of Stimulation Thresholds. 
The aim of performing implantation in children as 
young ës 12 months is to provide auditory stimula- 
tion during early developmental stages. Thus, ensur- 
ing that suprathreshold stimulation is delivered from 
the onset of device activation is essential. Behavior- 
al measures at young ages are limited; responses may 
not be reliable and may not be indicative of thresh- 
old versus comfortably loud levels. These difficul- 
ties may be exacerbated in children with congenital 
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causes of hearing loss, as they may be unfamiliar 
with auditory stimuli. 


We have found that EAPs recorded with the NRT 
system are useful tools in predicting T levels and can 
be used to provide even very young children with 
auditory stimulation from their cochlear implants be- 
ginning on the first day of device activation. These 
initial levels should then be refined over time with 
behavioral measures that will likely become more 
reliable as the child learns to detect, recognize, and 
reliably respond to auditory stimuli. 


Our results show correlations between EAP and 
behavioral thresholds that are lower than those re- 
ported elsewhere in older children and adults. 1017,18 
These findings are likely a result of 2 issues particu- 
lar to the present study. First, the behavioral results 
obtained in infants are less reliable than results from 
older implant users. Second, our MAP stimulation 
rates were higher than those used in previous studies 
and were not the same in all children. Despite these 
issues, we.found that on average, the behavioral mea- 
sures obtained by experienced audiologists showed 
thesame relationship to EAP thresholds as those found 
in older children and adults. The mean EAP thresh- 
old levels were between the mean T levels and the 
mean C levels for all 3 electrodes studied, falling at 
58% to 71% of the dynamic range (C level minus T 
level). We also found that the method of Hughes et 
al!7 for predicting T levels from NRT thresholds re- 
sulted in mean predicted thresholds that were only 
minimally (and not significantly) different from mean 
T levels. Like Hughes et al, we found that EAP thresh- 


olds evoked by basal electrodes tended to fall athigher - 


percentages of the behavioral dynamic range. Thus, 
predicted T levels may be slightly less accurate for 
basal electrodes than for middle array and apically 
located electrodes. This is an area for future study. 


Neurologic Implications. We found that EAP N1- 
P2 amplitudes were larger and N1 latencies were 
shorter in infants who underwent implantation be- 
tween 12 and 24 months of age than in postlingually 
deafened adults who underwent implantation as 
adults or adolescents. The amplitude findings are con- 
sistent with reports of amplitude growth functions 
that were steeper in children with congenital deaf- 
ness than in adults with postlingually acquired deaf- 
ness.” The duration of deafness before implantation 
is usually shorter in infants than in adults; thus, the 
degenerative effects of auditory deprivation might 
be more limited. In very young children, then, elec- 
trical pulses from the cochlear implant could be stim- 
ulating a larger population of functioning spiral gan- 
glia and/or evoking a greater degree of neural re- 
sponse synchrony, perhaps resulting in responses of 


larger amplitude and shorter latency. 


We also found that the EAP N1-P2 amplitudes 
were larger for apical implant electrodes than for ba- 
sal electrodes in children who underwent implanta- 
tion between 12 and 24 months of age. This finding 
is consistent with those of others.!7 Moreover, we 
have previously reported increased EABR ampli- 
tudes and decreased EABR latencies evoked by api- 
cal as compared to basal electrodes (unpublished ob- 
servations). These findings suggest that a smaller pop- 
ulation of neural elements are being stimulated at 
the basal end of the implant array than at more api- 
cal areas of the array, either because of the larger dis- 
tance between the electrode array and the neurons at 
the basal end of the array, or because of a smaller 
population of viable neurons existing in the base of 
the cochlea. The latter theory is supported by ana- 
tomic findings of poorer spiral ganglion survival in 
the base of the cochlea than at more apical areas in 
humans who had sensorineural hearing loss.”3 


CONCLUSION 

The goal of performing implantation in very young 
children is to provide auditory stimulation during 
early development. We must therefore aim to pro- 
vide suprathreshold and comfortable stimulation for 
these children as soon as their implants are activated. 
The EAPs measured with the Nucleus 24 NRT sys- 
tem are useful objective measures of stimulation re- 
quirements in children of all ages, including those 
who undergo implantation between 12 and 24 years 
of age. These measures are relatively easily recorded 
in such children and can be used to predict behav- 
ioral T levels with a correction factor determined with 
behavioral threshold data from only 1 electrode from 
the middle of the array. The gold standard for MAP 
T levels remains accurate behavioral measures that 
can be obtained once the child has learned to detect 
and consistently respond to auditory input. 


Behavioral measures remain essential, as they re- 
flect auditory processing along the central auditory 
pathways, whereas the EAP reflects activity only of 
the primary auditory neurons. In the present study, 
we found that in very young children, as compared 
to adults, electrical stimulation may evoke a synchro- 
nous response from a larger population of spiral gan- 
glia. In infants, we also found the amplitude re- 
sponses from neurons located at more apical areas 
of the implanted array to be greater than those at basal 
areas. 


Future research with EAPs should aim to elucidate 
the effect of electrical stimulation on the primary au- 
ditory neurons over time, and how it relates to behav- 
ioral stimulation requirements and perception of 
sound with a cochlear implant. 
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INFANT COCHLEAR IMPLANTATION AND ANESTHETIC RISK 


NANCY M. YOUNG, MD 
CHICAGO, ILLINOIS 


In light of the strong trends toward performing cochlear implantation in infants, it is necessary to consider anesthetic issues. Just 
as anesthetic risk may play an important role in surgical candidacy in the elderly population, anesthesia is also of special consider- 
ation in infants. Even healthy infånts are known to be at increased risk for anesthetic complications; for this reason, most elective 
surgical procedures are not routinely done within the first year of life. Therefore, it is necessary to consider anesthetic issues when 
contemplating the use of cochlear implants in infants less than 12 months of age. 


KEY WORDS — anesthetic risk, cochlear implant, pediatric anesthesia. 


INTRODUCTION 


Clinical trials required for commercial approval 
of cochlear implant systems by the US Food and Drug 
Administration have demonstrated that implantation 
is relatively safe and effective for children as young 
as 12 months of age. A small but growing number of 
infants younger than 12 months of age have under- 
gone implantation. Recent widespread implementa- 
tion of mandated newborn hearing screening in the 
United States has made the issue of whether to per- 
form implantation in infants one of practical impor- 
tance. Therefore, the time has come to ask whether 
infant cochlear implantation should be performed on 
a routine basis. Although there are many issues to 
consider, the subject of anesthetic risk is of special 
importance when contemplating surgical interven- 
tion during infancy. 


Over the past 100 years, the practice of anesthesia 
has developed into a separate academic specialty that 
has enabled modern-day surgery to flourish. The de- 
velopment of more predictable anesthetic agents, ad- 
vances in monitoring systems, and specialized recov- 
ery and intensive care units have-made possible ad- 
vances in surgery by contributing greatly to improved 
patient outcomes.! Through research, education, and 
training, the current generation of anesthesiologists 
in the United States has systematized anesthetic prac- 
tice in order to improve patient safety. However, de- 
spite these achievements, there is significant evidence 
that infants are at increased anesthetic risk in com- 
parison to older children and adults. 


REVIEW OF LITERATURE 


Several large studies have demonstrated that mor- 
bidity and mortality rates are higher during the first 





12 months of life. A prospective study from France 
examined 40,240 anesthetic cases between 1978 and 
1982 in children younger than 15 years of age, in- 
cluding 2,103 infants.? The incidence of complica- 
tions among infants (4.3 per 1,000) was significantly 
higher than that in older children (0.5 per 1,000; p < 
.001). The complications in infants seen in this study 
were mainly a result of respiratory failure, in con- 
trast to those in older children, who experienced cir- 
culatory failure as often as respiratory failure. 


Keenan et al? studied the incidence of bradycar- 
dia in children younger than 4 years of age who un- 
derwent noncardiac surgery. They found that a high- 
er incidence of bradycardia occurred in infants young- 
er than 1 year of age (1.3%) than in children in the 
second, third, and fourth years of life (0.98%, 0.65%, 
and 0.16%, respectively). The morbidity associated 
with bradycardia in infants was substantial and in- 
cluded hypotension (32%), asystole or ventricular fi- 
brillation (14%), and intraoperative death (8%). Risk 
factors for bradycardia included significant preopera- 
tive medical illness, long versus short surgery, emer- 
gency versus elective surgery, and a supervising non- 
pediatric anesthesiologist versus a pediatric anesthe- 
siologist. This study confirmed that bradycardia dur- 
ing anesthesia is more likely to occur during infancy 
and that it is a significant adverse event, because of 
its associated morbidity and mortality. This was the 
first study to document the potential benefit of using 
pediatric specialists during surgery on infants. 


The same investigators subsequently performed a 
retrospective analysis of the frequency of anesthe- 
sia-related cardiac arrest in infants to determine 
whether the presence of a pediatric anesthesiologist 
had an effect on patient outcomes.* They compared 


From the Division of Pediatric Otolaryngology, Children’s Memorial Medical Center, and the Department of Otolaryngology—Head and Neck 


Surgery, Northwestern University Medical School, Chicago, Illinois. 


CORRESPONDENCE — Nancy M. Young, MD, Section of Otology and Neurotology, Division of Pediatric Otolaryngology, Children’s Memo- 


rial Medical Center, 2300 Children’s Plaza, #25, Chicago, IL 60614. 


50 Young, Infant Cochlear Implantation & Anesthetic Risk 


2,310 patients supervised by pediatric anesthesiolo- 
gists with 2,033 patients supervised by nonpediatric 
anesthesiologists within the same medical facilities. 
Although the two groups of patients were compar- 
able in terms of age, weight, and general health, the 
rates of cardiac arrest between the two groups did 
differ significantly. There were 4 anesthetic cardiac 
arrests in the nonpediatric anesthesiologists’ group 
(19.7 per 10,000); no anesthesia-related cardiac ar- 
rests occurred in the pediatric anesthesiologists’ 
group. 

In order to gain additional insight into anesthetic 
morbidity and mortality risks, since 1985, the Amer- 
ican Society of Anesthesiology has been studying 
closed-claim surgical malpractice cases. A 1993 re- 
port reviewed 2,400 closed claims, of which 283 
(11.8%) involved children younger than 15 years of 
age.° Twenty-eight percent of these pediatric claims 
involved infants younger than 12 months of age, the 
majority of whom were 6 months of age or younger. 
The group of children 6 months of age or younger 
had a larger percentage of claims than any other pe- 
diatric age group. A different distribution of damag- 
ing events was found in the pediatric claims in com- 
parison with the adult claims. The mortality rate was 
greater in the pediatric claims (50%, versus 35% in 
adult claims; p < .01), despite the fact that the pedi- 
atric patients had better general health. The compli- 
cations in the pediatric group were also more fre- 
quently judged to have been preventable, and the care 
was more frequently judged to have been substan- 
dard. The respiratory system was more often involved 
in pediatric claims (43%, versus 30% of adult claims). 
Younger children were more likely to have airway- 
related claims, and the group of infants 6 months of 
age or younger accounted for 20% of these types of 
claims. A significant number of airway management 
problems, such as difficult intubation or unintended 
extubations, occurred in the pediatric claim group. 


In 1994, the Pediatric Perioperative Cardiac Ar- 
rest (POCA) Registry was formed in order to study 
the clinical factors and outcomes associated with car- 
diac arrest in children who receive anesthesia. In 
2000, a report reviewing the cases provided to the 
POCA Registry was published.® The report analyzed 
289 cases submitted by the 63 institutions that had 
voluntarily agreed to provide an anonymous, stan- 
dardized data form for each cardiac arrest that oc- 
curred in a child. Seventy percent of all anesthetic- 
related cardiac arrests were related to medication 
(40%) or cardiovascular causes (30%). Two thirds of 
the medication-related arrests were at least in part 
caused by halothane-induced cardiovascular depres- 
sion. The majority of arrests in children with good 
health (American Society of Anesthesiologists physi- 


cal state I or II) were medication-related. Infants 
younger than | year of age accounted for 55% of all 
anesthetic-related arrests. 


DISCUSSION 


Children are not small adults. This truism certainly 
applies to infants who undergo anesthesia. From a 
physiologic perspective, it may be helpful to think 
of infants as a different species. They have an imma- 
ture respiratory control mechanism. Rising carbon 
dioxide levels, which normally cause hyperventila- 
tion, may cause apnea in infants. Infants also have 
increased oxygen demand and diminished oxygen re- 
serves in comparison to older children and adults. 
Therefore, respiratory compromise can quickly cause 
hypoxemia. 


Infants are born with normal vagal responses, but 
their sympathetic nervous systems have only begun 
to develop. Therefore, they may not elicit the nor- 
mal “fight-or-flight” response to stress. Instead, stress 
from anesthesia may produce a paradoxical decrease 
in heart rate and blood pressure. The contractility of 
the infant heart is also limited, and as a consequence, 
infants are unable to significantly increase stroke vol- 
ume. Therefore, their cardiac output is purely rate- 
dependent. As a result, bradycardia is more worri- 
some in infants, because it is more likely to cause a 
sudden decline in blood pressure and cerebral blood 
flow in infants than in older children capable of dif- 
ferent physiologic responses. 


Infant anatomy also provides for a more challeng- 
ing anesthetic experience. Intravenous access may 
be difficult to achieve. Lack of intravenous access 
severely hampers the anesthesiologist’s ability to 
urgently provide fluids and medication. The infant 
airway can be difficult to control because of its small 
size and the increased likelihood of laryngospasm. 
The anesthesiologist’s job is also complicated by the 
fact that all inhalation agents act as cardiac depres- 
sants in infants. In addition, the dose-response curves 
of these agents are much more variable in infants 
than in older children. Careful titration, instead of 
simple weight-based dosing, is necessary. However, 
successful titration is challenging, because infants ex- 
perience more rapid uptake of inhalational anesthet- 
ics. 


It is important for surgical subspecialists to un- 
derstand that a very small percentage of the total num- 
ber of surgeries performed on children occur in in- 
fants younger than 12 months of age. Traditionally, 
surgery in this age group has been reserved for man- 
agement of urgent conditions and congenital anoma- 
lies in which waiting is not an option or would in- 
crease the likelihood of subsequent complications. 
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Therefore, most anesthesiologists, in their training 
and in their practice, have limited opportunity to gain 
experience with this age group. 


Although even healthy infants are at increased an- 
esthetic risk, certain subpopulations are at greater risk 
than others. Those with more challenging anatomy 
and/or physiology during infancy include preterm in- 
fants,”™-? infants with congenital heart disease as well 
as other anomalies, and those with central nervous 
system disease. Infants without a specific medical 
diagnosis, but who are not growing and thriving, are 
also of concern from an anesthetic perspective. 


CONCLUSION 


Surgical subspecialists need to consider anesthetic 
issues when contemplating cochlear implantation dur- 





ing infancy, because adverse anesthetic events are 
more common in this age group than in older chil- 
dren. Because infants 6 months of age and younger 
appear to be the most likely to experience problems, 
implantation in this age group in the absence of an 
urgent indication may be ill-advised. Infants with 
medical conditions that add to their anesthetic risk 
may not be good surgical candidates for cochlear im- 
plantation during the first 12 months of life. 


The decision to perform implantation in a child 
during infancy needs to be individualized. The in- 
fant’s health and development and the availability 
of anesthesiologists and supportive care units who 
have experience with infants are factors that should 
influence the timing and location of surgical interven- 
tion. 
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ROLE OF INFANT VOCAL DEVELOPMENT IN CANDIDACY FOR AND 
EFFICACY OF COCHLEAR IMPLANTATION 
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Previous research has documented the importance of audition in the development of typical infant vocalization. Of particular 
interest is the development of canonical babbling, which is related to mastery of the timing elements of speech. Children with severe 
to profound hearing loss who use hearing aids have demonstrated both delayed and deviant canonical babbling. The vocal develop- 
ment of 12 children has been followed as they have been considered for cochlear implantation. Nine of these children have under- 
gone implantation, and 5 of these children have been followed for more than 1 year after implantation. On average, canonical 
babbling emerged at 6.5 months after implantation. The time frame in which some children developed words was accelerated in 
terms of length of auditory experience as compared with normal-hearing peers. Mature oral-motor development is likely the primary 
contributing factor in this time course. One child who received a cochlear implant began to babble with hearing aids. It is not known 
whether normal canonical babbling is sufficient evidence on which to base candidacy decisions; however, our data suggest that 


children who produce canonical babbling maintain and build upon those skills after implantation. 


KEY WORDS — canonical babbling, cochlear implant, hearing aid, vocal development. 


Children with bilateral severe to profound sensori- 
neural hearing loss as young as 12 months of age are 
now being considered candidates for cochlear im- 
plantation. The current candidacy guidelines for chil- 
dren for US Food and Drug Administration—approved 
devices stipulate that children must also show little 
or no benefit from amplification and show limited 
development of auditory skills.! This lower age at 
which parents can choose implantation for their child 
presents new challenges to professionals who work 
with these children to document their candidacy by 
the child’s first birthday. Although electrophysiologi- 
cal and behavioral test measures for infants can quan- 
tify the degree of hearing loss, it remains a challenge 
to document whether a child can use his or her aided 
residual hearing to develop speech, language, and au- 
ditory skills. Linguistic skills vary widely at this age, 
not only in children with hearing loss, but even in 
normal-hearing children. 


The use of infant vocal behaviors as one tool in 
the determination of implant candidacy and efficacy 
is currently being investigated. A number of research- 
ers have documented the normal stages of vocal de- 
velopment observed in hearing children. The stages 
of normal infant vocal development include reflex- 
ive, vegetative, and cry vocalizations from birth to 3 
months of age; precanonical vocalizations, such as 
vowel attempts, emerging around 4 months of age; 


and canonical syllables or true babbling, occurring 
between 7 and 9 months of age in normal-hearing 
children. Canonical babbling is characterized by 
true consonant-vowel repetitions with regular tim- 
ing between the consonant and vowel portions of the 
syllable. The timing elements of canonical babbling 
are believed to form the foundation of speech timing 
in fluent conversational speech. In addition, work by 
Oller et al?+ has documented that the canonical bab- 
bling of children with severe to profound hearing 
losses differs significantly from that of normal-hear- 
ing children in a number of specific ways. First, the 
onset of the babbling is much later than that of nor- 
mal-hearing children, the variety of phonemes used 
is reduced, the volume of babbling is reduced, and, 
most importantly, the timing elements between the 
consonant and the vowel are significantly longer than 
those of normal-hearing children.*4 


Thus, children’s vocalizations studied during co- 
chlear implant candidacy and after implantation may 
provide information on which to judge candidacy for 
implantation and predict outcomes. Specific ques- 
tions addressed by our research program include 1) 
What type and quantity of vocalizations constitute 
“minimal progress” in auditory and speech skills, 
which are requisite to implantation? 2) What is the 
course of infant vocal development after implanta- 
tion? and 3) What is the interaction between vocal 
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development before implantation and the develop- 
ment of speech after implantation? 


METHOD 


Participants. Twelve children who were consid- 
ered candidates for implantation were enrolled in this 
preliminary study. Five of the children underwent im- 
plantation and have been followed for at least 1 year 
after implantation. All of the children received a Nu- 
cleus 24M cochlear implant between 18 and 20 
months of age. None of the children had surgical 
complications that would be indicative of possible 
poor performance with the device. All of the chil- 
dren received aural rehabilitation services through 
their local educational agency in Iowa, which used a 
simultaneous communication approach, as well as 
performing auditory training. 


Protocol. Each child was scheduled to be seen at 
monthly intervals before and after cochlear implan- 
tation, but the actual time between visits varied be- 
tween 1 and 3 months for some of the children. Video 
and digital tape recordings were obtained with the 
child, a parent, and an examiner interacting with fa- 
vorite toys and books, bubbles, and specific toys in 
an elicited speech task. This speech task, developed 
by the first author, was composed of 15 age-appro- 


_priate words sampling the entire vowel quadrilateral 


and age-appropriate consonants. Although it was not 
expected that the children would initially produce 
the words, it was felt that consistent exposure to the 
words and objects would lead to the ability to par- 
ticipate in the task. This task was developed to serve 
the needs of the research project until the children 
were old enough to participate in the current speech 
protocol (sentence imitation and story retell tasks). 


4 months 2 months 2 weeks 5 weeks 


The data were classified into 4 vocalization cate- 
gories: reflexive vocalizations, precanonical babbling, 
canonical babbling, and words. Interobserver agree- 
ment between 2 coders on 20% of the data was ob- 
tained. Ten-minute segments of data were coded in- 
dependently by 2 transcribers. The types of vocali- 
zations were identified in an ongoing time sequence, 
and interobserver agreement was calculated by a 
point-by-point comparison of each instance of vocal 
behavior. For the reliability measures completed, in- 
terobserver agreement averaged 90% for the identi- 
fication of reflexive vocalizations, 87% for the iden- 
tification of precanonical vocalizations, and 87% for 
canonical babbling. A portion of the segments iden- 
tified as canonical babbling were analyzed acoustical- 
ly to verify the presence of canonical babbling ac- 
cording to the criteria of Oller et al.3+ Specifically, 
the timing of the second formant frequency transition 
between consonant release and the nucleus of the vow- 
el could not exceed 120 ms for the segment to be con- 
sidered normal canonical babbling. 


RESULTS 


For the 12 children who have developed canoni- 
cal babbling, the average age of onset was 6.5 months 
after implantation. Four of the 5 children who have 
had their implant for more than 1 year began using 
words by 12 months. None of those children showed 
canonical babbling in the candidacy phase. Acoustic 
analysis confirmed that the babbling occurred in nor- 
mal rhythmic intervals and met the timing criteria to 
be considered normal; however, there was a reduced 
variety of phonemes produced and a reduced num- 
ber of canonical babbling occurrences in the sample. 
For the sake of brevity, we discuss here 2 children 
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whose results illustrate interesting trends observed 
in the data. Figures | and 2 illustrate the rate of use 
of the different types of vocal development over time; 
however, the y-axis on the two Figures does not uti- 
lize the same scale. One of the characteristics of in- 
fants is their variability of behavior from one ses- 
sion to the next; therefore, overall trends of develop- 
ment are of interest. One child presented with asym- 
metric hearing loss and underwent implantation in 
his poorer ear. The benefit he received from amplifi- 
cation in the better ear was sufficient for him to devel- 
op canonical babbling before he received a cochlear 
implant. As illustrated in Fig 1, after only 5 weeks of 
implant use, his precanonical vocalization rate in- 
creased, while his canonical babbling rate remained 
relatively constant, over that time period. It appeared 
that he demonstrated continued and uninterrupted de- 
velopment of canonical babbling despite the introduc- 
tion of new auditory input. Additional data will need 
to be gathered and analyzed to understand the inter- 
action between the child’s preimplantation speech 
skills and his development over time. In Fig 2, the 
rate of responses for the 4 vocalization types is shown 
for another child. She presented with a very low rate 
of both reflexive and precanonical vocalizations dur- 
ing the preimplantation phase. After implantation, 
there was an immediate and dramatic rise in preca- 
nonical vocalizations, with canonical syllables emerg- 
ing 4.5 months after implantation and words emerg- 
ing 12 months after implantation. This child is more 
typical of an implant candidate who had virtually no 
residual hearing before implantation. 


DISCUSSION 
The results of this preliminary study suggest that 


7 mo 9 mo 12 mo 


children who are considered as implant candidates 
present with differing stages of vocal development 
during candidacy for the device. It is unclear at this 
time whether canonical babbling during candidacy 
for implantation should or should not be considered 
“minima! progress” in the development of speech and 
auditory skills. It would be premature to suggest that 
a child who had developed the normal timing fea- 
tures of canonical babbling should not be considered 
a candidate for cochlear implantation during infancy. 
Conversely, it also is unclear whether that child would 
be considered an excellent candidate, because the de- 
vice would support the continued development of 
those prerequisite speech skills. Continued study is 
needed to answer this question. Regarding our sec- 
ond question on the course of vocal development, 
we do have evidence to suggest that the implant leads 
to increased precanonical vocalizations, with canoni- 
cal vocalizations emerging an average of 6.5 months 
after implantation. For all of the children, the implant 
allowed them the necessary auditory experience to 
develop normal timing of canonical babbling, an es- 
sential skill in speech development, and 4 of the 5 
children who have more than 1 year of experience 
with the device have developed words by 12 months 
after implantation. The course of vocal development 
was consistent with the stages of development seen 
in normal-hearing children. Regarding the interac- 
tion between vocal development and the implant, the 
male patient described above was able to transfer pre- 
implantation canonical babbling skills to the postim- 
plantation phase. Our data on this child are limited, 
and further research will continue to address this po- 
tential interaction effect. We do not yet know whether 
these skills will lead to even faster or more normal 
speech acquisition over time. 
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The Nucleus 24 Contour is a new cochlear implant that has recently undergone clinical trials in adults and children. The Contour 
uses the same electronics as the previous-generation Nucleus 24 (CI24M) but incorporates a downsized receiver-stimulator and a 
perimodiolar electrode array. The indications for use were expanded to include children as young as 12 months of age and children 
24 months of age and older with severe to profound hearing loss who had open-set word recognition up to 30%. The Contour was 
successfully implanted in 256 children. The mean level of postoperative speech perception with the Contour was significantly better 
than the preoperative baseline with hearing aids on all measures. Children who had open-set speech perception in an audition-only 
condition before surgery demonstrated higher levels of postoperative performance with the Contour than children with no open-set 


speech perception before surgery. 


KEY WORDS — ACE (advanced combination encoders) strategy, cognitive ability, linguistic ability, maximum comfort level, 


perimodiolar electrode array, spectral peak strategy, threshold level. 


INTRODUCTION 


A clinical trial to establish the safety and effec- 
tiveness of the Nucleus 24 Contour in adults and chil- 
dren was initiated in February 2000. The Contour is 
electrically equivalent to the commercially released 
Nucleus 24, but has a downsized receiver-stimulator 
package and a precurved electrode array that is de- 
signed to be inserted into the cochlea with minimal 
trauma, in proximity to the perimodiolar wall (Fig 
1). The Nucleus 24 Contour was specifically designed 
to accommodate the small skull size of very young 
children with less mastoid drilling than conventional 
cochlear implants. It has a very low-profile, angled 
shape to lie flat against the head of children as young 
as | year of age. The perimodiolar electrode array 
preserves important attributes of the 22-electrode 
banded array while providing additional functional 
advantages. Like the Nucleus 24 straight array, the 
Nucleus 24 Contour has 22 pure platinum contacts 
that span the speech frequency range within the co- 
chlea. In contrast to the straight array, the electrode 
contacts of the Contour are half-bands oriented in 
the direction of the target auditory neurons and spi- 
ral ganglion cells. 

The Contour electrode array is manufactured 
curved and is straightened for insertion by a malle- 
able platinum stylet that is housed within a lumen 
along the entire length of the array. During inser- 
tion, the smooth silicone rubber surface of the array 
glides along the outer cochlear wall. After the elec- 
trode is fully inserted, the stylet is withdrawn, allow- 


ing the electrode to relax to its resting shape, which 
matches the mean modiolar spiral of the human co- 
chlea. The Contour electrode array occupies a rela- 
tively small proportion of the scala tympani volume, 
does not require the use of invasive bands or position- 
ers to achieve perimodiolar placement, and does not 
apply static pressure to delicate cochlear tissues. 


There are several potential advantages to perimo- 
diolar electrode placement. Orienting the stimulat- 
ing electrodes toward and in close proximity to the 
target auditory neurons reduces the electrical current 
required to elicit auditory percepts. Reducing cur- 
rent requirements improves the power efficiency of 
the system, potentially improving battery life and al- 
lowing the development of smaller speech proces- 
sors that run at faster stimulation rates. In addition, 
lower, more focused current may improve spatial se- 
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Fig 1. Nucleus 24 Contour cochlear implant. 
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TABLE 1. DEMOGRAPHICS FOR THREE 
AGE-DETERMINED. SUBGROUPS 


25 mo to 
12 to24mo 4yll mo 25 y 











Age at implantation 


(y; mean + SD) 1545.1 34410 9.743.6 
Duration of hearing Joss* 

(y; mean + SD) 13413 28413 6644.5 
Cause of hearing loss 

Unknown 61.7% 79.4% 57.7% 

Familial ` 21.1% 11.8% 12.8% 

Meningitis 8.3% 1.5% 7.7% 

Mondini deformity 3.3% 5.1% 

Viral 1.7% 1.5% 3.8% 

Ototoxicity 3.8% 

Other 3.3% 5.9% 9.0% 


*Severe to profound sensorineural hearing loss in implanted ear. 





lectivity and reduce channel interaction, thus poten- 
tially improving speech perception outcomes. 


METHODS 


Subject Selection. With progressive improvements 
in cochlear implant technology and concomitant re- 
cipient outcomes, the selection criteria for clinical 
trials have broadened. The specific design elements 
of the Nucleus 24 Contour, including its smaller size 
and robust titanium case, made it appropriate for trial 
use in children as young as 12 months of age — 
younger than in previous implant trials. In addition, 
higher levels of preoperative speech perception were 
allowed on the basis of the expected level of postop- 
erative outcomes. The selection criteria for children 
younger than 2 years of age were more conservative 
than those for older children. The criteria for infants 
12 to 24 months of age were bilateral profound sen- 
sorineural hearing loss and a plateau in auditory skill 
development. The criteria for children 25 months to 
17 years of age were severe to profound bilateral sen- 
sorineural hearing loss, a plateau in auditory skill ‘de- 
velopment (given a history of appropriate interven- 
tion), and a Multisyllabic Lexical Neighborhood Test 
(MLNT) or Lexical Neighborhood Test (LNT) word 


recognition score of 30% or less in the bẹst-aided 
` condition. 


Study Design. Consistent with previous cochlear 
implant trial designs, the Nucleus 24 Contour was 
evaluated in children at a large number of investiga- 
tional sites (46) by use of a single-subject repeated- 
measures design. Postoperative evaluations in the im- 
plant-alone condition were conducted at 3 and 6 
months after activation and compared with a preop- 


erative baseline conducted in the “‘best-aided” condi- - 


tion. The subjects were tested while using the coding 
strategy used in their everyday lives (advanced com- 
bination encoders [ACE], spectral peak [SPEAK], 


or continuous interleaved sampler [CIS]) and using 
either a body-worn SPrint or an ear-level ESPrit 
speech processor. Because the optimal coding strat- 
egy can be difficult to assess in a very young child, 
the default recommendation was to use the ACE strat- 
egy at a moderate stimulation rate (eg, 720 or 900 
Hz per channel). This recommendation was based 
on prior comparative data from adults that demon- 
strated a strong preference for ACE and significantly 
better sentence recognition in noise with ACE than 
with SPEAK or CIS.! 


The primary dependent measures varied for each 
of the age subgroups, reflecting the children’s de- 
veloping cognitive and linguistic competence. A pa- 
rental rating scale of listening behaviors (Infant Tod- 
dler Meaningful Auditory Integration Scale [IT- 
MAIS?) was used to assess progress in infants be- 
tween 12 and 24 months of age. For children between 
25 months and 4 years of age, the low verbal version 
of the Early Speech Perception (ESP) battery,> the 
Glendonald Auditory Screening Procedure* (GASP), 
the MLNT,° and the Meaningful Auditory Integra- 
tion Scale (MAIS)® were administered. The MLNT 
was administered recorded, and the GASP and ESP 
were administered using monitored live voice. The 
GASP was administered in open-set format, and all 
measures were presented at 70 dB sound pressure 
level. 


Children 5 years of age and older were adminis- 
tered the standard version of the ESP, the GASP, the 
LNT (monosyllabic words), the Hearing In Noise Test 
for Children (HINT-C),’ and the MAIS. Al test stim- 
uli for the older children were presented with recorded 
materials. 


Subjects. The Nucleus 24 Contour had the short- 
est clinical trial to date of any cochlear implant that 
has been approved for children by the US Food and 
Drug Administration. Between February 2000, when 
the Nucleus 24 Contour clinical trial was initiated, 
and November 2000, when it was approved for com- 
mercial distribution, 256 children underwent implan- 
tation. In contrast to previous cochlear implant tri- 
als, the preponderance of children enrolled in this 
study were very young. Eighty-three subjects (32.4%) 
were 24 months of age or younger, and 48 of these 
(18.8% of total) were 18 months of age or younger. 
Seventy-five children (29.3%) were between 25 
months and 4 years of age, and 98 children aCe: 3%) 
were 5 years of age or older. 


The demographics for the 3 subgroups of children 
are presented in Table 1. The majority of subjects 
were either born deaf or acquired severe to profound 


_ hearing loss before the acquisition of language. The 


subset of children for whom postoperative data are 
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TABLE 2. RESIDUAL HEARING BY FREQUENCY AT 
ONE MONTH AFTER TUNE-UP 








Frequency (Hz) 
250 500 1,000 2,000 4,000 6,000 








No. of children with 
measurable hearing 
before surgery 

No. of children with 
measurable hearing 
after surgery 26 24 15 12 6 6 


Data are for children from 12 months to 17 years 11 months of age 
(n = 54). 


47 46 43 31 21 21 





presented is relatively small because of the short du- 
ration of the trial and the large number of very young 
children whose limited linguistic and cognitive skills 
precluded their ability to take standardized tests. Data 
are presented for all children for whom postopera- 
tive scores were received for the 3-month (n = 162) 
and/or 6-month (n = 71) postactivation intervals. 


RESULTS 


Psychophysics. Two of the design goals for the 
Contour perimodiolar electrode array were 1) to pro- 
vide simple electrode insertion with minimal intra- 
cochlear trauma and 2) to reduce the level of current 
required to provide auditory stimulation. Evidence 
supporting the achievement of the first goal includes 
surgical outcomes and assessment of residual hear- 
ing preservation. With the exception of 2 children 
with postmeningitic cochlear ossification, all Con- 
tour electrodes were successfully inserted to their full 
design specification. A survey of participating sur- 
geons indicated that 86% of respondents (32 of 37) 
reported that the Contour electrode array was easier 
to insert than the previous Nucleus straight array. In 
addition, 69% of respondents (20 of 29) reported that 
the Contour was easier to insert than other cochlear 
implant electrodes, and 97% (28 of 29) preferred the 


-E CI24M C-Level 
-Contour C-Level 
—@-CI24M T-Level 
-O-Contour T-Level | 


Current Level 





123 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 
Electrode Number 
Fig 2. Mean threshold levels (T levels) and maximum 


comfort levels (C levels) for Nucleus 24 (n = 40) and 
Nucleus Contour (n = 56) in adults. 
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Fig 3. Mean Infant Toddler Meaningful Auditory Inte- 

gration Scale (IT-MAIS) scores by question for infants 


(12 to 24 months) before surgery and at 1-month, 3- 
month, and 6-month postactivation intervals. 





Always 


Frequently 


Occasionally 


Rarely 





Never 


downsized receiver-stimulator package to the larger 
Nucleus 24, particularly for use in infants. 


For further assessment of the safety of the design, 
acoustic audiograms were obtained in the implanted 
ears before surgery and after 1 month of device use. 
Postoperative audiometric data were reported on 82 
subjects. Fifty-four of the 82 subjects (65.8%) had 
measurable hearing at 1 or more frequencies in the 
implanted ear before surgery. Fifty-four percent of 
these children (29 of 54) had measurable hearing on 
at least 1 frequency after 1 month of device use (Table 
2). Although the postoperative thresholds were uni- 
versally reduced from their preoperative levels, the 
preservation of at least some hearing is an indication 
that the Contour Electrode produces minimal intraco- 
chlear trauma during insertion. 


The reduction in current requirements is demon- 
strated by data collected in the companion Contour 
trial in adults. Figure 2 demonstrates the mean thresh- 
old levels (T levels) and maximum comfort levels 
(C levels) for 40 adult subjects with the Contour peri- 
modiolar electrode, compared with 56 adult subjects 
with the Nucleus 24 straight electrode array. All sub- 
jects were programmed with the ACE coding strat- 
egy at 900 Hz per channel. The mean (+SD) C levels 
were reduced from 196.9 + 20.4 current levels for 
the straight Nucleus array to 178.4 + 20.1 current 
levels for the Contour. Similarly, the mean T current 
levels were reduced from 152.3 + 20.2 for the straight 
array to 139.9 + 19.9 for the Contour. The changes 
observed for C levels were statistically significant 
across the electrode array (p < .05). The changes ob- 
served for T levels were also significant for electrodes 
7 through 22 (p < .05). 


Speech Perception. Figure 3 shows the mean scores 
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Fig 4. Preoperative and postoperative mean MAIS and 
closed-set speech perception scores for children 25 months 
to 4 years of age. 


on the 10 questions of the IT-MAIS for 60 infants 
younger than 2 years of age at up to 4 test intervals. 
The IT-MAIS allows parents to rate their child’s spon- 
taneous listening behaviors on tasks such as alerting 
to sound, discrimination of 2 speakers, and associat- 
ing vocal tone (eg, anger, excitement, fear) with its 
meaning on the basis of hearing only. Before sur- 
gery, the infants exhibited (on average) the 10 lis- 
tening behaviors quantified by the test either “nev- 
er” or “rarely” when they were using hearing aids. 
After only 3 months of implant use, the mean scores 
ranged between “occasionally” and “frequently” for 
6 of the 10 listening behaviors and between “fre- 
quently” and “always” on 3 of 10 listening behav- 
iors. After 6 months of Contour use, the infants mani- 
fested 6 of the 10 listening behaviors either “fre- 
quently” or “always” on average. Although it is not 
a formal speech perception measure, the IT-MAIS 
provides a good metric for judging early changes in 


C] Preoperative 
100 E 3 Months Postactivation 
HE 6 Months Postactivation 
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N= 76 4524 7445 24 7746 25 78 43 24 78 43 24 76 3721 


Spondees Mono GASP 


Fig 5. Preoperative and postoperative mean speech per- 
ception scores for children 5 to 17 years of age. See text 
for abbreviations. 





GEE Preoperative 
EE 3 Months Postactivation 
HEE 6 Months Postactivation 


LNT Words* 
Speech Measure 


LNT Words** 


Fig 6. Postactivation open-set word recognition scores 
at 3 and 6 months as function of preoperative residual 
hearing. * — Scored greater than chance (0%) before 
surgery on Lexical Neighborhood Test (LNT) word test 
(before surgery and 3 months after device activation; n= 
25; 6 months after activation, n = 13). ** — Scored 0% 
before surgery on LNT word test (before surgery and 3 
months after activation, n = 18; and 6 months after acti- 
vation, n = 9). 


basic developmental listening behaviors, especially 
in very young children, who have limited attending 
and linguistic skills and for whom we have very few 
appropriate test measures. 


Figure 4 shows the mean performance over time 
on the MAIS, ESP Pattern Perception, ESP Spondee 
Identification, and ESP Monosyllable Identification 
for young children between 25 months and 4 years 
of age. Although the open-set GASP and MLNT mea- 
sures were also administered, because of the prepon- 
derance of 2-year-olds in this group, relatively few 
of the children had the linguistic or cognitive abili- 
ties to take these difficult tests. The mean total score 
onthe MAIS improved from 11 (27.5%) to 27 (67.5%) 
correct out of 40 possible points after 6 months of 
Contour use. On the closed-set ESP test, the mean 
preoperative scores were around chance performance 
for the 3 subtests (pattern, spondee, and monosyl- 
labic word identification). The mean performance im- 
proved significantly on these tests after limited ex- 
perience with the Contour. The mean scores improved 
from 30.5% + 26.8% to 67.0% + 39.2% on pattern 
perception, from 29.0% + 21.7% to 73.2% + 39.6% 
on spondee identification, and from 27.4% + 18.2% 
to 62.8% + 40.0% on monosyllable identification 
after 3 months of device use. The similarity of the 
mean scores at the 3- and 6-month postactivation in- 
tervals may reflect the small number of subjects in 
this young age group who had achieved the longer 
evaluation interval at the time of this report. 


Figure 5 summarizes the mean open-set word and 
sentence recognition for the oldest children (=5 years) 
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Fig 7. Comparison of pediatric outcomes for Nucleus 24 
Contour (n = 25) and Nucleus 24 (n = 75) after 6 months 
of device use. See text for abbreviations. 


before surgery and at 3 and 6 months after activa- 
tion. As described earlier, selection criteria allowed 
limited preoperative open-set speech perception, ex- 
emplified by means of 6.7% + 9.9% and 11.4% + 
21.3% on the LNT words and HINT-C sentences. 
The mean performance improved significantly on all 
measures after only 3 months of Contour use. The 
mean performance on GASP words improved from 
34.4% + 34.4% to 63.0% + 39.7% at 3 months and 
to 71.5% + 36.0% at 6 months. Similarly, LNT word 
recognition improved from 6.7% + 9.9% to 30.1% + 
29.6% at 3 months and to 41.2% + 33.0% at 6 months. 
Performance on HINT-C sentences also improved 
from 11.4% + 21.3% to 45.8% + 35.0% at 3 months 
and to 60.9% + 37.1% at 6 months. This is remark- 
able progress on difficult open-set measures with 
such limited device experience. 


The inclusion of children with greater levels of 
preoperative speech perception provided the opportu- 
nity to assess the effect of this variable on postopera- 
tive outcomes. The study selection criteria allowed 
children 2 years of age and older to demonstrate se- 
vere to profound hearing loss and up to 30% open- 
set word recognition on either the MLNT (2 to 4 
years) or the LNT (25 years). Figure 6 shows the 3- 
and 6-month postactivation mean scores on the LNT 
for children with no open-set word recognition be- 
fore operation in the implanted ear, compared with 
children with some level of preoperative open-set 
speech perception. The mean LNT score after 3 
months of device use for the children with preopera- 
tive open-set abilities was 44.9% + 28.1%, as com- 
pared with 22.0% + 29.8% for the children with no 
preoperative open-set speech perception. Similarly, 
after 6 months of device use, the mean LNT scores 
were 58.4% + 26.8% and 25.8% + 36.9% for the chil- 
dren with better and poorer preoperative hearing, re- 


spectively. These differences were significant (for 3 
months. Z = —2.199, p = .028; for 6 months, Z = 
—2.08, p = .038) for both evaluation intervals, sup- 
porting the expansion of selection criteria to include 
children with greater levels of preoperative residual 
hearing. 


DISCUSSION 


The pediatric outcomes with the Nucleus 24 Con- 
tour extend the findings of previous cochlear implant 
trials and demonstrate significant improvements in 
auditory skills identified by parental rating and on a 
variety of speech perception measures. Inclusion of 
children as young as 12 months of age provided the 
opportunity to validate the advantages of the down- 
sized receiver-stimulator package and perimodiolar 
electrode array in very young subjects. The design 
revisions incorporated into the Contour were very 
well received by surgical investigators, particularly 
for use in infants. 


Consistent with the more perimodiolar placement 
of the Nucleus 24 Contour electrode array, mean T 
and C levels were lower than those observed with 
the straight array in a group of adult subjects. On 
average, the C levels shifted by 19.3 current levels, 
and T levels by 12.4 current levels. Clinically, this 
may mean improved access to faster rate strategies 
and/or smaller processors for those patients with high- 
er than average current level requirements. 


The pediatric outcomes for the Contour were com- 
pared with the previous-generation Nucleus 24 after 
6 months of device use (Fig 7). The mean postopera- 
tive scores on the ESP Spondee Identification, GASP 
words, and LNT words and phonemes are shown for 
the 2 groups. The mean scores for the Contour sub- 
jects were higher than those for the Nucleus 24 sub- 
jects by 8.6%, 4.3%, 13.7%, and 12.0% for Spondees, 
GASP, LNT words, and LNT phonemes, respectively. 
Because of the high intersubject variance, none of 
these differences was statistically significant. 


The study included children 2 years of age and 
older with severe to profound hearing loss and up to 
30% open-set word recognition on either the MLNT 
(2 to 4 years) or the LNT (25 years). Children who 
scored at greater than chance levels on these word 
tests before operation performed better after opera- 
tion than those who scored at less than chance levels 
before surgery. These data support the expansion of 
selection criteria to include children with greater lev- 
els of preoperative residual hearing. 


CONCLUSIONS 
The safety and effectiveness of the Nucleus 24 
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Contour were demonstrated in a sample of 256 chil- 
dren who underwent implantation at 46 investiga- 
tional sites. The indications for use were expanded 
to include children as young as 12 months of age 
and children 2 years of age and older with severe to 
profound preoperative hearing loss with open-set 
word recognition scores of up to 30%. Investigating 
surgeons reported that the perimodiolar Contour elec- 
trode array was easier to insert than the Nucleus 





straight array and that the downsized receiver-stimu- 
lator was preferred for young children. Postopera- 
tive speech perception was significantly better on all 
measures than was preoperative performance with 
hearing aids. Children with open-set speech percep- 
tion abilities before surgery demonstrated higher lev- 
els of postoperative performance with the Contour 
than children with no open-set speech perception be- 
fore surgery. 
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COCHLEAR IMPLANT CANDIDACY AND PERFORMANCE TRENDS IN 
CHILDREN 
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Data from Clarion cochlear implant pediatric clinical trials were examined retrospectively to uncover trends in candidacy and 
postimplant benefit over time. In particular, age at implantation, educational setting, and communication mode were examined with 
respect to speech perception performance after implantation. The results showed that 1) age at implantation is decreasing, 2) children 
in oral education programs obtain more benefit from a cochlear implant than children in total communication programs, 3) children 
who undergo implantation before 2 years of age show greater benefit than children who undergo implantation between 2 and 3 years 
of age, 4) more younger children are using oral communication than older children, and 5) more children with good auditory skills 
before implantation and more residual hearing are undergoing implantation. In sum, in the 11 years since implants have been avail- 
able to children in the United States, candidacy criteria have evolved and benefit has increased as cochlear implant technology has 


advanced. 
KEY WORDS — child, cochlear implant. 


INTRODUCTION 


Cochlear implants have been available in the 
United States for children (ages 2 to 17 years) since 
1990. During the succeeding decade, candidacy cri- 
teria have changed and benefit has increased as co- 
chlear implant technology has advanced. In this study, 
data from clinical trials of the Clarion cochlear im- 
plant were examined retrospectively to uncover trends 
in candidacy and postimplant benefit in children over 
time. In particular, age at implantation, educational 
setting, and communication mode were examined 
with respect to speech perception performance after 
implantation. 


TRENDS IN CANDIDACY 


The most obvious trend in children is a decrease 
in age at implantation!-3 (also S. Dolan-Ash, unpub- 
lished observations; and this supplement, pp 56-61 
and 66-68). In the most recent clinical trial of the 
Clarion HiFocus electrode (1999 to 2000), 47% of 
the children underwent implantation at or before 3 
years of age. Note, however, that older children still 
underwent implantation in the recent HiFocus group, 
although it was only about one third of the popula- 
tion (Fig 1). 


Another trend is in the communication mode used 
by children at the time of implantation. In the orig- 
inal Clarion pediatric clinical trial (1995 to 1997), 
63% of the children used oral communication and 
37% used total communication. This pattern presum- 
ably reflected the greater acceptance of cochlear im- 
plants by professionals and parents of children in oral 
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educational programs than in programs that incor- 
porated manual (signed English) communication. 
The auditory input provided by an implant was more 
likely to be beneficial to a child enrolled in a pro- 
gram that emphasized development of spoken lan- 
guage. 


Notably, there was an increase in the proportion 
of children using total communication by the time 
of the HiFocus clinical trial (2000). In the HiFocus 
group, approximately half of the children used total 
communication and half used oral communication 
at the time of implantation. That trend suggests that 
because of the increased benefit resulting from im- 
proved technology, children in total communication 
programs, in fact, can benefit from cochlear implants. 
An increase in the acceptance of cochlear implants 
in general also may contribute to the higher number 
of children in total communication programs who 
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Fig 1. Mean age at implantation (months) for children 
who underwent implantation during clinical trial of Clar- 
ion HiFocus Electrode and Positioner (1999 to 2000). 
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Fig 2. Communication mode by age at implantation 
(years) for children who underwent implantation during 
clinical trial of Clarion HiFocus Electrode and Positioner 
(1999 to 2000). TC — total communication. 


received implants. 


In the HiFocus group, there was a relationship be- 
tween communication mode and age at time of im- 
plantation. The majority of children who underwent 
implantation at a very young age (<2 years of age) 
used oral communication (Fig 2). This trend reflects 
the increasing availability of oral education options 
for very young children. The oral education philoso- 
phy, combined with the auditory input provided by 
an implant, has contributed to the increasing benefit 
realized by very young children. In contrast, there is 
a trend for a higher percentage of older children (5 
to 6 years of age) to use total communication at the 
time of implantation. These children did not undergo 
implantation when they were younger because, pre- 
sumably, the notion was not accepted by parents and 
teachers until only recently. 


TRENDS IN PERFORMANCE AFTER 
IMPLANTATION 


Younger Children. In order to examine the effects 
of age at implantation on postimplant benefit in 
young children (<3 years of age), we combined data 
from all clinical trials. A total of 90 children 3 years 
of age or younger had undergone implantation with 
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a Clarion implant. The children were divided into 2 
groups for analysis: children who underwent implan- 
tation between 12 and 23 months of age (n = 59) and 
children who underwent implantation between 24 and 
36 months of age (n = 31). 


Postimplant benefit was determined with the In- 
fant Toddler Meaningful Auditory Integration Scale 
(IT-MAIS).3 The IT-MAIS is a criteria-referenced 
parent interview tool that uses a structured interview 
technique to obtain information about the frequency 
with which a child demonstrates a set of 10 auditory 
or speech behaviors in everyday listening situations. 
The IT-MAIS consists of 10 probes or questions that 
are used to query the parent about the frequency with 
which the child demonstrates target behaviors in ev- 
eryday situations. The target behaviors are develop- 
mentally appropriate for infants and toddlers and in- 
clude behaviors such as searching for the sound 
source, imitation of sound, and responding to the ces- 
sation of sound. These behaviors precede the more 
overt auditory responses to sound seen in older chil- 
dren. On the basis of this information, the examiner 
assigns a rating consistent with the frequency of oc- 
currence of each behavior by using the following 
scoring system: 0, never; 1, rarely; 2, occasionally; 
3, frequently; or 4, always. 


The analyses showed that although the preoper- 
ative IT-MAIS scores were comparable for both 
groups, the score 3 months after implantation was 
significantly higher (p < .01) for the children who 
underwent implantation before 2 years of age than 
the children who underwent implantation between 2 
and 3 years of age. In other words, a difference of as 
little as 1 year in age at implantation had a great im- 
pact on the rate of auditory skill development. At 6 
months, the difference in IT-MAIS scores approached, 
but did not reach, statistical significance (p < .07; 
Fig 3A). Itis not clear whether the smaller difference 
in IT-MAIS scores at 6 months reflects the smaller 
sample size at that interval, or whether the older chil- 
dren are catching up. Future data will indicate the 
direction of this trend. 
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Fig 3. Preoperative and postopera- 
tive (3 and 6 months) Infant Toddler 
Meaningful Auditory Integration 
Scale scores by A) age at implanta- 
tion (* — p<.01; ** —p<.07) and 
B) communication mode (* — sig- 
nificant difference) for children who 
underwent implantation during clini- 
cal trials of Clarion implant. 
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Fig 4. Preoperative and postoperative (3 and 6 months) speech perception performance by communication mode for older 
children who underwent implantation during clinical trials of Clarion implan‘. * — Significant difference. A) Monitored live- 


voice testing. B) Recorded materials testing. 


Additional analyses showed that when both age 
groups were combined, children in oral education 
programs had significantly higher IT-MAIS scores 
than children in total communication programs (p < 
.O1 at 3 months; p < .05 at 6 months; Fig 3B). This 
difference results from the greater emphasis placed 
on auditory skills in oral as compared with total com- 
munication programs. 


Older Children. Performance in older children (>3 
years of age) was examined in the 26 children in the 
HiFocus clinical trial who were older than 3 years of 
age at time of implantation. Their mean age at the 
time of implantation was 9 years, and their mean pure 
tone average thresholds were 106 dB hearing level. 
Forty-two percent of the children were in oral edu- 
cation programs, and 58% were in total communica- 
tion programs. 


Before implantation, the children in oral educa- 
tion programs tended to have better speech percep- 
tion skills at the time of implantation than children 
in total communication programs. This pattern likely 
resulted from the fact that children with good audi- 
tory skills were likely to be placed in oral programs. 
In contrast, children with more limited auditory skills 
were more likely to be placed in total communica- 
tion programs. 


After implantation, the children in oral education 
programs performed significantly better on all mea- 
sures of speech perception than did children in total 
communication programs (Fig 4). That great differ- 
ence likely stemmed from the fact that the children 
in oral education programs had better auditory skills 
in general, or perhaps more residual hearing, than the 
children in total communication programs. Interest- 
ingly, the children with good preoperative auditory 
skills probably would not have been considered for 
implantation several years ago. However, their excep- 
tional speech perception skills after implantation 


show that they make remarkable use of the auditory 
input provided by the cochlear implant. 


CONCLUSIONS 


Taken together, the Clarion clinical trial data show 
the following trends in cochlear implant candidacy 
and benefit in children. 


l. Age at implantation is decreasing. This trend 
reflects improved identification of profound hearing 
loss in very young children, increased public aware- 
ness of cochlear implants, and the high level of bene- 
fit provided by contemporary cochlear implant tech- 
nology. 


2. Children in oral education programs obtain 
more berefit from a cochlear implant than children 
in total communication programs — even children 
younger than 3 years of age. This trend indicates that 
oral education programs more effectively emphasize 
use of the auditory information provided by an im- 
plant than do total communication programs. 


3. Children who undergo implantation before 2 
years of age show greater benefit than children who 
undergo mplantation between 2 and 3 years of age. 
(This general conclusion may be confounded by com- 
munication mode.) This trend suggests that earlier 
implantation facilitates closer-to-normal rates of oral 
language acquisition. 


4. A greater proportion of younger children than 
older children are using oral communication. This 
trend ind cates that more oral education options have 
become available for very young children. The his- 
torical increase in benefit provided by cochlear im- 
plants most likely precipitated the need for more oral 
language—based programs. 


5. Children with good auditory skills and more 
residual hearing before implantation are undergoing 
implantation more today than in the past. These chil- 





Osberger et al, Trends in Implanted Children 65 
dren may not have been considered implant candi- technology, these children now can realize benefit 
dates in the past because of their “good” hearing. from a cochlear implant that surpasses the benefit 
However, because of advances in cochlear implant they could receive from conventional hearing aids. 
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PEDIATRIC PERFORMANCE WITH THE MED-EL COMBI 40+ 
COCHLEAR IMPLANT SYSTEM 


DARLA C. FRANZ, MA 
DURHAM, NORTH CAROLINA 


Presented here are the results of speech perception measures and subjective scales from 68 children with 6 months of device 
experience and from 49 children with 12 months of experience in using the Med-El Combi 40+ cochlear implant system. All children 
demonstrated bilateral profound hearing loss before operation, and 90% experienced onset before 3 years of age, but their hearing 
losses had a broad range of causes. A battery of age-appropriate objective speech perception measures was administered, in addition 
to a battery of subjective auditory skill assessments. The group data indicate that the children showed a statistically significant 
improvement in a wide array of speech perception skills and auditory behaviors with this device. In addition, subjective therapist 
survey data indicated changes in primary receptive and expressive communication method with Combi 40+ device use. Individually, 
all children showed significant improvement on at least 1 of the speech perception measures used in the test battery. This evidence 
Suggests that the Med-El Combi 40+ system can provide meaningful auditory information for profoundly deaf children, which in 


turn supports meaningful functional improvement in speech perception and communicative skill development. 


KEY WORDS — cochlear implant, Med-El Combi 40+, pediatrics, speech perception. 


INTRODUCTION 


The Med-E] Combi 40+ cochlear implant system 
is the most recent implant system to become available 
in the United States. Globally, it is available in 52 
countries and is supported by 12 subsidiary offices 
worldwide. The current version of the Combi 40+ 
device has been in use in other countries since 1996. 
It features a thin (<4 mm) package profile, deep (31 
mm) electrode insertion, and 12 channels of mono- 
polar stimulation at an overall rate of 18,180 pulses 
per second (Fig 1). An ear-level speech processor, 
the Tempo+, is available with a wide variety of wear- 
ing options (Fig 2). A body-worn processor, the Cis- 
Pro+, is also available. The device has been described 
in detail elsewhere in the literature. !-3 


The Combi 40+ cochlear implant system received 
approval from the US Food and Drug Administration 
(FDA) in August 2001. The Combi 40+ clinical trial 
for the adult population began in the United States 
in November 1997. The adult study was expanded 
to include the pediatric population in April 1998. This 
report presents the results of 68 children with 6 
months of experience, and of 49 of those children 
with 12 months of experience, in using the standard 
Combi 40+ electrode array and either the CisPro+ or 
Tempo+ speech processors. 


MATERIALS AND METHOD 


This work reflects the cooperation of 37 clinical 
sites in the United States who have participated in 
FDA clinical trials for the device. Device efficacy 


was evaluated with a single-subject repeated-mea- 
sures design, with the subject serving as his or her 
own control. Tests administered to each age group 
are described in the Table. A variety of objective mea- 
sures were coupled with subjective parent and thera- 
pist report scales to attempt to quantify gains seen 
both in the test setting and in daily life. The Commu- 
nicative Skills Checklist is a new measure that has 
not been reported in the literature to date and will 
not be reported in detail here. 


In addition, rehabilitative providers were also asked 
to categorize the child’s expressive and receptive com- 
munication method at each postoperative follow-up 


AGE STRATIFICATION OF STUDY POPULATION AND 
MEASURES ADMINISTERED TO EACH SUBGROUP 


Mean Age 
at Time of 
N Implantation (y) 








Measures 


ESP Low Verbal 

GASP words 

MLNT 

IT-MAIS 
Communicative Skills 
Communication method 
ESP Standard 

GASP words 

MLNT 

LNT 

BKB Sentences 

MAIS 

Communicative Skills 
Communication method 


Test Group 
Younger (<5 y) 48 2.89 





Older (>5 y) 39 9.36 





The author is employed by Med-El Corporation, Durham, North Carolina. 
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Fig 1. Combi 40+ cochlear implant. 


point. This was defined as the primary method that 
adults used to communicate with the child, and the 
primary method the child used to communicate with 
others. The discrete choices were verbal communica- 
tion, cued speech, total communication, and total 
communication with sign language emphasis. 


The subjects for this study were bilaterally pro- 
foundly deaf children 18 months to 17 years 11 months 
of age. The mean age at implantation for the popula- 
tion as a whole was 5.8 years, and the mean duration 
of profound hearing loss was 4.4 years. The study 
population was stratified into 2 age groups, and tests 
were administered that most appropriately reflected 
the developmental abilities of each age group (see 
Table). Seventy-nine of the children (90%) were noted 
to have experienced onset of profound deafness be- 
fore 3 years of age, so this can be considered largely 
a prelinguistically deafened group. Only 6 children 
were noted to have a clearly progressive hearing loss. 
A wide range of etiologies was represented, with “un- 
known origin” being the most common. 


RESULTS 
Statistical analysis was performed on the sample 
with a paired t-test comparing baseline data to the 6- 
month data point (n = 68) and comparing baseline 
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Fig 2. Tempo+ ear-level speech processor. Processor has 
multiple wearing options. In this view, processor is cou- 
pled to angled battery pack, which has direct input port 
for connection to assistive listening devices. 
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Fig 3. Group mean data on objective measures compares 
both 6- and 12-month data points to baseline. Low Verbal 
(LV) Early Speech Perception (ESP) test was adminis- 
tered to young children only (n = 48). Glendonald Audi- 
tory Screening Procedure (GASP) was administered to 
all children (n = 68). All other measures were adminis- 
tered to older children (n = 39). Population consisted of 
68 children with preoperative data, 68 children at 6 
months, and 49 children at 12 months. 


data to the 12-month data point (n = 49). If the data 
were not normally distributed, a Wilcoxon matched- 
pairs signed-rank test was used in place of a paired 
t-test. No children were excluded from the analysis 
on the basis of poor performance on the test measures. 
Testing was performed before operation and 3, 6, and 
12 months after operation. The data from 6 and 12 
months are presented here. All children experienced 
an increase in at least | test score in the battery. 


Objective Measures. The group mean performance 
improvement at the both the 6- and 12-month postop- 
erative intervals was statistically significant (p < .001 
at 12 months) on all of the objective measures in- 
cluded in the study. Mean values are presented on each 
measure (Fig 3). Both the Low Verbal and Standard 
versions of the Early Speech Perception (ESP) test* 
have 3 subtests: pattern perception, spondee identifi- 
cation, and monosyllable identification, each progres- 
sing in difficulty over the first. The Glendonald Audi- 
tory Screening Procedure (GASP) words? are scored 
by the number of words correctly perceived. Both the 
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Fig 4. Results of Meaningful Auditory Integration Scale 
administered to older group. 0 — never; | 
— occasionally; 3 — frequently; 4 — always 


rarely; 2 
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Fig 5. Primary communication method. Therapists were 
asked to categorize primary communication method used, 
both receptively and expressively, over course of study. 
Discrete choices were verbal communication, tota! com- 
munication (TC), total communication with emphasis on 
signed or manual language, and cued speech. 


Multisyllabic Lexical Neighborhood Test (MLNT) 
and the Lexical Neighborhood Test (LNT)® are scored 
for correct word and phoneme perception. Bamford- 
Kowal-Bench sentences’ are scored for correct word 
identification. 

Subjective Measures. Meaningful changes in be- 
haviors related to device bonding, spontaneous alert- 
ing to sound, and deriving meaning from sound were 
seen on subjective measures such as the Meaningful 
Auditory Integration Scale (MAIS)® and the Infant 
Toddler Meaningful Auditory Integration Scale (IT- 
MAIS)? over the course of the study. The MAIS re- 
sults for the older children are shown in Fig 4. After 
6 months of device use, improvement was seen on 
all probe questions aside from probe 2. Probe 2 as- 
sesses whether the child appears to be upset if his or 
her device is not functioning. Before operation, the 
average response to this probe indicated that children 
already “frequently” appeared upset if their hearing 
aids were not functioning. This finding most likely 
reflects the level at which the children in the study 
had already bonded with their hearing device and 
relied on even limited auditory input before receiving 
an implant. Improvement was seen on probe 2 at the 


12-month interval. In fact, after 12 months of device 
use, the average response had increased to “always” 
on more than half of the probe questions. The mean 
total MAIS score improved from 20 before operation 
to 31 and 35 after 6 and 12 months of use, respectively 
(40 possible points). 


Communication Method. Changes in primary com- 
munication method are shown in Fig 5. This graph 
represents the percentage of the total sample of chil- 
dren who used a given communication method as their 
primary means of expressive and receptive commu- 
nication. In other words, rehabilitative professionals 
were asked the questions “How does this child usual- 
ly communicate with others?” and “How do others 
usually communicate with him or her?” It was not 
required that a child use any given method profici- 
ently; rather, this was an attempt to quantify the real 
changes in communication abilities in daily life over 
time. An absolute increase in the number of children 
who relied on verbal communication for both expres- 
sive and receptive communication was seen over the 
course ofthe study. Consequently, the number of chil- 
dren who were reported to rely on total communi- 
cation or total communication with a manual empha- 
sis for either expressive or receptive communication 
decreased over the period of the study. 


DISCUSSION 


These group data indicate that children show a sta- 
tistically significant improvement in a wide array of 
speech perception skills and a meaningful increase 
in auditory behaviors with this device. In addition, 
subjective therapist reports indicated an increase in 
the proportion of children who used verbal communi- 
cation as their primary receptive and expressive meth- 
od after Combi 40+ device use. This evidence sug- 
gests that the Med-El Combi 40+ system can pro- 
vide beneficial auditory stimulation for profoundly 
deaf children, which in turn supports meaningful im- 
provement in speech perception and an increase in 
the use of spoken communication. 
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EFFECTS OF AGE AT IMPLANTATION IN YOUNG CHILDREN 
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This study examined the effects of age at implantation on the development of communication abilities in children with early 
implantation. The 73 participants were prelingually deafened, received a cochlear implant before 5 years of age, and used current 
cochlear implant technology. The children were administered a battery of speech and language outcome measures before implanta- 
tion and again at successive 6-month postimplant intervals. A mixed model analysis was used to examine the rate of growth in word 
recognition and language skills as a function of age at time of implantation. The results revealed significant improvements in com- 
munication skills over time. Spoken word recognition improved at a faster rate in the oral children with early implantation. However, 
the children who underwent implantation before 3 years of age had significantly faster rates of language development than did the 
children with later implantation. The oral children demonstrated more rapid gains in communication abilities than did the children 


who used total communication. 


KEY WORDS — child, cochlear implant, deafness, hearing impairment, language development, speech perception. 


The first pediatric cochlear implant recipient un- 
derwent implantation by Dr William House approxi- 
mately 20 years ago. Over the past 2 decades, clini- 
cians and scientists have seen tremendous techno- 
logical advances in cochlear implant design. The cur- 
rent multichannel cochlear implant systems offer an 
array of electrode and speech processing options and 
provide substantial levels of open-set word recogni- 
tion to the majority of children who receive them.!+ 
The guidelines for determining pediatric cochlear im- 
plant candidacy also have evolved. Recently, the US 
Food and Drug Administration modified candidacy 
guidelines to permit implantation in children with pro- 
found deafness as young as 12 months of age; im- 
plantation of patients with severe-to-profound hear- 
ing loss is permitted in children 2 years of age and 
older. 


With the evolution in cochlear implant candidacy, 
more children than ever before have the potential to 
develop spoken word recognition and language pro- 
cessing using auditory input from a cochlear implant. 
However, great variability exists in pediatric cochlear 
implant outcomes.>~7 Identifying factors that contrib- 
ute to this variability may have an important impact 
on determining candidacy and designing appropri- 
ate intervention strategies for this population. 


One factor that has been shown to influence the 
development of communication abilities in children 


with cochlear implants is their age at the time of im- 
plantation.*8 Previous work in our laboratory and 
others demonstrated that children who undergo im- 
plantation before 5 years of age achieve significantly 
better communication outcomes than children who 
undergo implantation after that age.5%10 Other re- 
search suggests that even earlier implantation is bene- 
ficial.!1-14 For example, Waltzman and Cohen!3 dem- 
onstrated that the speech perception abilities of 11 
children who underwent implantation before 2 years 
of age increased significantly from before implanta- 
tion to their latest interval after implantation. 


The purpose of this study was to examine the ef- 
fects of age at implantation on the rate of communi- 
cation development in a large number of children with 
early implantation. Because communication mode 
also has been shown to influence pediatric cochlear 
implantation outcomes,!5-16 the effects of age at im- 
plantation were examined separately for children who 
used different communication modes. 


METHODS 
PARTICIPANTS 


The participants in this investigation were a sub- 
set of the children followed longitudinally at Indiana 
University School of Medicine in studies of pediat- 
ric cochlear implant outcomes. All children in our 
longitudinal study were prelingually deafened and 
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TABLE 1. DEMOGRAPHIC CHARACTERISTICS 





_ TABLE 2. DEVICE CHARACTERISTICS 








Total 
Communication 
Oral Group Group 
CIl<3 Cl>3 Cl<3 CI>3 
(N = 25) (N = 17) (N = 16) (N = 16) 





Total 
Communication 
Oral Group Group 
Cl<3 CI>3 Cl<3 CI>3 
(N = 25) (N = 17) (N = 16) (N = 16) 





Mean age at onset of 


deafness (y) 0.18 0.29 0.18 0.08 
Mean age at implant 

fitting (y) 2.13 3.89 2.24 3.90 
Mean unaided pure 

tone average (dB HL) 109 108 111 109 


CI<3 — received cochlear implant before 3 years of age; CI>3 — 
received cochlear implant between 3 years and 4 years 11 months of 
age; HL — hearing level. 





have a severe to profound hearing loss with no addi- 
tional handicapping conditions. All participants in 
the longitudinal study who met 3 additional criteria 
were included in the present investigation. Two of 
the additional criteria were implantation before 5 
years of age and use of current-generation cochlear 
implant technology and processing strategies. The 
third criterion was that the children had to have been 
administered our Preschool Assessment Battery!” be- 
fore implantation and at intervals after implantation. 
Seventy-three children met the criteria for participa- 
tion. The children first were separated into 2 groups 
based on their communication method (oral commu- 
nication versus total communication, or the combined 
use of signed and spoken English). Within each com- 
munication group, the children were further stratified 
into 2 groups by their age at the time of implanta- 
tion. The CI<3 groups received a cochlear implant 
before 3 years of age, and the CI>3 groups received 
a cochlear implant between 3 years and 4 years 11 
months of age. Table | presents the average demo- 
graphic characteristics of the oral and total commu- 
nication groups separately as a function of age at im- 
plantation. Their device characteristics are presented 
in Table 2. 

STIMULUS MATERIALS AND PROCEDURES 


The children in our longitudinal studies are admin- 
istered a battery of speech and language tests before 
implantation and every 6 months after implantation 
for at least 3 years. The children then are tested an- 
nually for as long as they continue in the study. Data 
from 2 measures in our battery were analyzed for 
the present investigation. 


Mr Potato Head Task. The Mr Potato Head task is 
an open-set measure of spoken word recognition. Mr 
Potato Head consists of a plastic potato “body” with 
approximately 20 body parts and accessories. The 
children are asked to manipulate the toys in response 
to 10 verbal commands presented in the auditory- 
only mode, eg, “Give him some green shoes.” The 


Nucleus CI — Speak 


processing strategy 15 12 11 12 
Nucleus CI — ACE 

processing strategy 5 l 4 4 
Clarion CI — CIS 

processing strategy 5 4 l 0 


Speak — spectral peak; ACE — advanced combination encoders; 
CIS — continuous interleaved sampling. 





child’s responses are scored as the percentage of key 
words and sentences correctly identified. For exam- 
ple, if the child responded to the above command by 
touching the green shoes, he or she would get credit 
for a key word, but not for a sentence. 


Reynell Developmental Language Scales. This is 
a test of receptive and expressive language devel- 
oped for children with normal hearing. It has been 
used extensively with children with hearing impair- 
ment. The test requires children to comprehend or 
express a hierarchy of language structures ranging 
from object labeling to complex instructions. This 
test is thought to reflect the kinds of communication 
demands children face in daily living. It was admin- 
istered in the child’s preferred mode, either oral com- 
munication or total communication, in this study. The 
test yields receptive and expressive age-equivalency 
scores. In our study, these were then converted to lan- 
guage quotients by dividing the child’s language age 
by his or her chronological age. Language quotients 
less than 1.0 indicate that the language age is less 
than the chronological age. Language quotients great- 
er than 1.0 indicate that the language age is greater 
than the chronological age. 


Data Analysis. Analyses of variance with repeated 
measurements were used for data analysis. In the 
model, subject was considered a random effect; 
length of device use and age at implantation were 
the independent variables. The dependent measures 
were the percentage of words correctly identified on 
the Mr Potato Head Task and the language quotients 
on the Reynell Developmental Language Scales. All 
test scores available for each child were included in 
the analysis. The number of data points varied across 
children for several reasons. First, the children in the 
study had used their cochlear implants for differing 
lengths of time, and therefore, the numbers of times 
they had been tested at 6-month intervals varied as 
well. Second, not every child was available for test- 
ing at every interval. Finally, we were not always 
able to administer all of the tests in our battery to 
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Oral Group 


TC Group 
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Fig 1. Rate of improvement in per- 
centage of words correctly identified 
on Mr Potato Head Task shown as 
function of age at time of implanta- 
tion. CI — cochlear implant, TC — 
total communication. 
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every child, for reasons such as time constraints on 
the family or the child’s attention span. The number 
of children tested and the numbers of observations 
across those children are indicated below. 


RESULTS 


Mr Potato Head Task. Data from 138 observations 
obtained from 40 different oral children were in- 
cluded in this analysis; for the total communication 
group, 129 observations from 32 children were avail- 
able. Figure 1 illustrates the growth in word recogni- 
tion performance as a function of age at time of im- 
plantation and communication group. There was a sig- 
nificant improvement in word recognition over time 
for the oral group (p < .0001). The rapid growth rate 
for the oral children suggests that children will achieve 
maximum scores on the Mr Potato Head Task within 
2 or 3 years after implantation. For the oral children, 
there was also a significant effect of age at the time 
of implantation (p < .0006). Oral children who under- 
went implantation between 3 and 5 years of age dem- 
onstrated a more rapid growth in word recognition 
than did the children who underwent implantation 
before 3 years of age. 


———— 
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Although the children in the total communication 
group demonstrated significant improvement in word 
recognition over time (p < .0001), there was no ef- 
fect of age at implantation on their rate of word rec- 
ognition development. Figure 1 suggests that the 
growth rate of word recognition was much more gra- 
dual for the children who used total communication 
than for the children who used oral communication. 


Reynell Language Development Scales. Perfor- 
mance on the receptive portion of the Reynell test is 
illustrated in Fig 2. For both the oral and total com- 
munication groups, there was a significant increase 
in receptive language skills over time (p < .0001) 
and a significant interaction between age at implan- 
tation and length of cochlear implant use (p < .0006 
for the oral group; p < .035 for the total communica- 
tion group). That is, the receptive language abilities 
of children who underwent implantation before 3 
years of age increased faster than did those of chil- 
dren who underwent implantation between 3 and 5 
years of age. The growth rates of receptive language 
abilities appeared similar for the oral and total com- 
munication groups. 
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Fig 2. Rate of improvement in lan- 
guage quotient scores obtained on 
„receptive portion of Reynell Devel- 
opmental Language Scales as func- 
tion of age at time of implantation. 
Language quotients were derived by 
dividing each child’s receptive lan- 
guage age obtained on Reynell scale 
by child’s chronological age. 
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Fig 3. Rate of improvement in lan- 
guage quotient scores obtained on ex- 
pressive portion of Reynell Develop- 
mental Language Scales as function 
of age at time of implantation. Lan- 
guage quotients were derived by di- 
viding each child’s expressive lan- 
guage age obtained on Reynell scale 
by child’s chronolọgical age. 
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Expressive language abilities also improved sig- 
nificantly over time for children in the oral communi- 
cation group (p < .0001) and the total communica- 
tion group (p < .019), as shown in Fig 3. There was a 
trend for children who underwent implantation be- 
fore 3 years of age to develop expressive language 
skills faster than the children with later implantation 
in both communication groups. However, this trend 
did not reach statistical significance. Finally, Fig 3 
suggests that the total communication children’s de- 
velopment of expressive language abilities was de- 
layed in comparison with that of their oral peers. 


DISCUSSION 


The children in this study, all of whom underwent 
implantation before 5 years of age, demonstrated sig- 
nificant improvements in word recognition and re- 
ceptive and expressive language skills after cochlear 
implantation. When the performance of the children 
was examined as a function of age at time of implan- 
tation (before or after 3 years of age), the effects var- 
ied depending on the test administered. The growth 
in word recognition abilities was significantly faster 
for oral children in the group with an older age at 
implantation. This result may have occurred because 
children 3 years of age and older are more familiar 
with the vocabulary used on the Mr Potato Head Task 
than are younger children.!8 Many of the children 
reached the ceiling of this task within 2 years or so 
of implant use. Many of the children in this study 
now have reached the age and skill level at which 
they can be administered more difficult measures of 
open-set word recognition, such as monosyllabic 
word lists or tests of sentence recognition in noise. It 
remains to be seen whether similar effects of age at 
implantation will be evident on these measures. 


When growth in language skills was measured with 


the Reynell test, the effects of age at implantation 
differed across the receptive and expressive compo- 


Length of Cl Use 


nents of the task. The rate of receptive vocabulary 
development was significantly faster for children who 
underwent implantation before 3 years of age than 
for those who underwent implantation after that age. 
Although the same trend was evident for expressive 
skills, it was not significant for either communica- 
tion group. 


The rates of receptive language development were 
similar for the oral and total communication chil- 
dren. In both groups, children who underwent im- 
plantation before 3 years of age appeared to be ac- 
quiring early receptive language skills on a par with 
those of their normally hearing peers. However, the 
same is not true for the development of expressive 
language skills. Although the oral children appear to 
be acquiring age-appropriate expressive language 
abilities, the total communication children do not. 
Even after 4 years of device use, their growth rate in 
expressive language skills did not approach a lan- 
guage quotient of 1.0 — a finding indicating that their 
expressive language age lagged behind their chrono- 
logical age. The reasons for this discrepancy are not 
clear. It may relate in part to the nature and extent of 
the language to which the children are exposed. 
Whereas oral children with hearing parents are ex- 
posed to spoken communication throughout the day, 
it is often the case that children who use total commu- 
nication have a more limited exposure to language. 
Many caregivers of children who use total commu- 
nication are learning signed English at the same time 
as their child. Furthermore, it is often the case that 
only a limited number of people in the child’s en- 
vironment know or are learning signed English. It 
may be that the linguistic environment of many chil- 
dren who use total communication is impoverished 
in comparison to that of their normal-hearing peers. 
This issue warrants future study. 


The present investigation demonstrates that im- 


ahh, 
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plantation before 3 years of age promotes the devel- 
opment of language abilities in children with pre- 
lingual profound hearing loss. Investigations in our 
laboratory are ongoing to determine whether even 
earlier implantation will be of benefit. With the goal 
of universal detection of hearing loss in infants by 3 
months of age, and appropriate intervention (eg, am- 
plification) by 6 months of age,!92° we expect that 


ever-increasing numbers of very young children will 
be identified as potential implant candidates. We know 
that early intervention (ie, by 6 months of age) and 
early intervention with hearing aids have a signifi- 
cant impact on language development in children with . 
hearing loss,?! but the spoken word recognition and 
language benefits of very early implantation in chil- 
dren with cochlear implants remain to be quantified. 
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Documentation is lacking regarding the ability of congenitally deaf children to attain age-appropriate spoken language skills 
using a cochlear implant, and how early implantation must occur if such development is to be possible. Spoken language data are 
presented for infants who underwent implantation at 18 months of age or younger. Additional data are used to compare outcomes 
among children who differed in age at implantation. Data collected at Carle Clinic and Foundation Hospital were reviewed for 47 
consecutively implanted children ranging in age from 9 to 48 months at implantation. These data were analyzed and compared by 
age-at-implantation groupings for speech perception skills, communication mode, and spoken language abilities. The groups dif- 
fered dramatically in abilities. The best outcomes occurred in children who underwent implantation at or before 18 months of age. 
Several of these infants demonstrated age-appropriate spoken language skills. We conclude that early implantation is desirable. 
Children who undergo implantation as infants may develop language skills commensurate with those of their hearing peers. 


KEY WORDS — cochlear implant, hearing loss, infant, spoken language. 


INTRODUCTION 


The average age at identification of hearing loss 
is declining as more infants are receiving hearing 
screenings shortly after birth. Earlier diagnosis pro- 
vides the opportunity for early cochlear implanta- 
tion in cases of severe-to-profound hearing loss. The 
auditory stimulation provided by a cochlear implant 
in infancy may allow for more normal maturation of 
auditory pathways.!? Evidence suggests that sensi- 
tive periods of neural development exist that are criti- 
cal for spoken language development.>+ If this is so, 
then early cochlear stimulation may provide the au- 
ditory link deaf children need for benefiting from such 
periods. 


Studies have shown that young congenitally deaf 
children who undergo cochlear implantation have the 
ability to learn language at rates comparable to those 
of their hearing peers.*-? Given this capacity, it is 
plausible to surmise that if a child underwent im- 
plantation early, the language deficits that often oc- 
cur when a child has severe-to-profound hearing loss 
could be minimized. Further, if in these instances 
children undergo implantation early enough to also 
benefit from critical periods of neural development, 
it is conceivable that they could develop normal spo- 
ken language abilities. Documentation to substanti- 
ate such outcomes is lacking, as are data indicating 
how early implantation must occur for this to be pos- 
sible. 


Twelve infants 18 months of age or younger have 


undergone implantation at Carle Clinic and Founda- 
tion Hospital. The children vary in duration of co- 
chlear implant use from | month to 4 years. Longi- 
tudinal spoken language data will be presented for 
10 of the 12 children who have at least 6 months of 
implant use. Additional data will be used to com- 
pare the performance of these children with children 
who underwent implantation at slightly older ages 
with respect to word recognition abilities, communi- 
cation mode, and spoken language levels. (Language 
data for some of these children have been previously 
reported.>) 


METHODS AND MATERIALS 


A retrospective analysis was conducted to review 
data compiled on 47 children who underwent co- 
chlear implantation at Carle Clinic and Foundation 
Hospital. The children ranged in age from 9 to 48 
months at the time of implantation. They were as- 
signed to one of four age groups on the basis of the 
age at which they received their cochlear implant: 9 
to 18 months (n = 12), 19 to 30 months (n = 13), 31 
to 40 months (n = 11), or 41 to 48 months (n = 11). 
(The groups had a similar distribution of number of 
subjects, age at identification of hearing loss, and 
cause.) Children who differed greatly from the core 
samples of subjects were not included in the analy- 
sis, eg, children who had significant cognitive de- 
lays in addition to their hearing loss, children who 
had progressive hearing loss, or children who had 
stopped using their device. 





From the Division of Otolaryngology, Expanding Children’s Hearing Opportunities (ECHO) Program, Carle Clinic and Foundation Hospital, 
Urbana, Illinois. This research was partially funded by a grant from the Carle Foundation. 


CORRESPONDENCE — Dianne M. Hammes, MA, Carle Foundation Hospital, 602 W University Ave (ECHO-NCE4), Urbana, IL 61801. 














Y 





Hammes et al, Early Implantation & Spoken Language 75 





Fig 1. Language progress over time for 10 sub- 
jects who underwent implantation at 18 months 
of age or younger. Subjects included are those who 
have at least 6 months of implant use. Dashed di- 
agonal line represents language progress expected 
when child has normal hearing. 


Language Age (in months) 


The children received Cochlear (MSP, Spectra, 
SPrint), Advanced Bionics (1.2, S-Series, Platinum), 
or Med-E] (CisPro+) devices. It was reported that 
they used their implant during all waking hours. 
Length of cochlear implant use ranged from 1 month 
for 1 subject in the infant group to as long as 9 years 
for some of the children in the 2 oldest age groups. 
With the exception of communication mode, data 
were compared for children who had 6 months of 
implant experience or more. 


Spoken language measures included the Skills for 
Hearing Impaired Children (SKI*HI),’ Preschool 
Language Scales—3 (PLS-3),9 Clinical Evaluation of 
Language Fundamentals P or II (CELF-P or CELF- 
TI)!!! and the Oral and Written Language Scales 
(OWLS).!? At the time of testing, language measures 
were chosen on the basis of the child’s age, attention 
span, and overall ability levels. With the exception 
of the SKI*HI, the tests used were ones that were 
standardized, designed for, and normalized on hear- 
ing children. The SKI*HI, the only parental report- 
based measure that was used, was designed for chil- 
dren with hearing loss. Whenever possible, language 
scores were based on standardized measures. The 
Phonetically Balanced—Kindergarten (PBK)!3 word 
lists were used as the comparison measure for open- 
set word recognition abilities among the subjects. All 
tests were administered in accordance with standard 
protocols as described in the respective test manu- 
als. For the group comparison data, speech percep- 
tion scores were reviewed with respect to the best 
performance demonstrated by each child. Spoken lan- 
guage abilities were analyzed in relation to outcomes 
at the most recent test interval. 


RESULTS 


To establish a baseline for the rate of progress and 
performance for the group of children who under- 
went implantation at 9 to 18 months of age, spoken 
language data were reviewed for the children who 
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had at least 6 months of cochlear implant usage (n = 
10). The longitudinal spoken language outcomes for 
these subjects are shown in Fig 1. The graph shows 
the subjects’ chronological age relative to their age 
equivalency scores on connected language measures. 
Generally, the scores were based on parental report 
measures (ie, the SKI*HJ) at the youngest ages, and 
standardized tests normalized on hearing children 
were used as the subjects approached 3 years of age. 
The dashed, diagonal line represents the progress that 
would typically be expected for a child who has nor- 
mal hearing. That is, it shows the line that results 
when a child’s language skills increase by 12 months 
for each year of life. 


As can be seen on the graph by the upward slope 
of the lines, the majority of infants demonstrate a 
language-learning rate similar to that of hearing chil- 
dren, albeit with some evidence of initial delays. Al- 
most half of the subjects (S1, $4, S6, and $18) are 
either on or very near to the dashed line, demonstrat- 
ing that they have closed early gaps to achieve lan- 
guage skills similar to those of their hearing peers. 
In some cases, the deaf subjects (S1 and S6) even 
slightly surpass the expected hearing norm. The 2 
children (S5 and $7) who demonstrate sustained slow- 
er rates of progress are subjects who have mild de- 
velopmental delays in addition to hearing loss. 


Figures 2-4 compare groups of children who un- 
derwent implantation at older ages with the children 
who underwent implantation at ‘18 months of age or 
younger in terms of open-set word recognition, com- 
munication mode, and spoken language achievement. 
The age at implantation for the various groups and 
the numbér of subjects included in each sample are 
indicated on each of the remaining graphs. Because 
the inclusion criteria vary at times (eg, on the basis 
of length of cochlear implant use or a child’s ability 


- to take the test), the n values on various graphs also 


change. 
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Fig 2. Best mean and median word score percentages on 
Phonetically Balanced—Kindergarten (PBK) word lists for 
subjects with 2 or more years of cochlear implant use. 
n — Number of children who could be tested divided by 
number of children in group who have at least 2 years of 
implant experience. 


The PBK was chosen on the basis of the availabil- 
ity of comparative clinical data across the different 
age groups and the test’s ability to assess each child’s 
perception of sounds with minimal need for context 
— an important skill for the development of spoken 
language. Figure 2 shows the word score means and 
medians on the PBK for the subjects who had at least 
2 years of implant experience. There were subjects 
in each age grouping who could not be tested, either 
because of their young age at the time of testing or 
because they lacked the necessary auditory skills to 
take the test. Whereas the infant group (n = 3) 
achieved a word score average of close to 80% cor- 
rect, the children who underwent implantation be- 
tween 19 and 30 months scored close to 60%. The 
children in the 2 later-implantation groups averaged 
closer to 40% correct. 


Communication through the use of spoken lan- 
guage is a common objective of cochlear implanta- 
tion. Figure 3 illustrates the primary mode of com- 
munication used before and after implantation across 
the various age groups. Regardless of age at implan- 
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tation, the majority of children used total communi- 
cation before receiving their implant. After receipt 
of the implant, all of the infant subjects made a suc- 
cessful transition to spoken language. Six children 
who underwent implantation between the ages of 19 
and 30 months were able to rely on spoken commu- 
nication solely, and the other 7 in this group did so 
situationally (eg, when speaking with familiar listen- 
ers or using routine language). In contrast, 3 of the 
children who underwent implantation between 31 and 
40 months of age and 7 of the children who under- 
went implantation between 41 and 48 months of age 
still relied on sign language for communication, even 
though some of these subjects had had their implant 
for as long as 9 years. In the older group, the only 
children who relied exclusively on spoken language 
after implantation were the 2 who had used audi- 
tory-oral communication before implantation. Two 
other subjects in the oldest group had developed the 
ability to use spoken language in some contexts. 


The ultimate goal of implantation is to better en- 
able deaf children to achieve age-appropriate lan- 
guage skills. Figure 4 shows how closely chronologi- 
cal age corresponded to spoken language age for the 
4 age groups. The spoken language achievements 
across the age groupings were compared by 3 differ- 
ent proximity criteria. The comparisons included all 
of the children who had at least 6 months of cochlear 
implant usage. Four of the 10 children who under- 
went implantation between 9 and 18 months of age 
had acquired spoken language skills within 6 months 
of their chronological age. Only | of the 13 children 
who underwent implantation between 19 and 30 
months of age met this standard, and none of the chil- 
dren who underwent implantation after 30 months 
of age achieved this level of performance. By a 12- 
month proximity criterion, 7 of the infants achieved 
the standard, as did 4 of the children who underwent 
implantation between 19 and 30 months of age and 
l child who underwent implantation between 31 and 
40 months. By an 18-month proximity criterion, 9 


atc Fig 3. Percentages of children who used each com- 

munication mode before and after implantation, 
& Comb. by age-at-implantation group. Subjects included 
E Oral have at least 3 months of implant use. TC — total 


communication; Comb. — combination (child can 
rely on spoken language in at least some contexts); 
Oral — child routinely uses spoken language for 
communication. 
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Fig 4. Percentages of children, categorized by age at im- 
plantation, who have acquired spoken language skills 
within 6, 12, or 18 months of their chronological age. Sub- 
jects included have at least 6 months of cochlear implant 
experience. 


of the 10 infants achieved the standard, but the num- 
ber of subjects did not increase among the children 
who underwent implantation after 30 months of age. 
The children who underwent implantation after 40 
months of age were unable to meet any of the proxim- 
ity criteria. 


DISCUSSION 


In our review of the results, several patterns 
emerged. The children who underwent implantation 
at 30 months of age or younger demonstrated better 
open-set word recognition abilities than similar groups 
of children who underwent implantation after 30 
months of age. In addition, the children who under- 
went implantation before 30 months of age were more 
likely to have spoken language skills within 12 
months of their chronological age than were the chil- 
dren who underwent implantation after that age. More 
specifically, the children who had undergone implan- 
tation at 18 months of age or younger were most 
likely to rely solely on spoken language for commu- 
nication. They were also the group with the highest 
number of children who had spoken language skills 
within 6 months of their chronological age. 


Although the spoken language outcomes are com- 
pelling, these data should be interpreted cautiously. 
First, because the children in the youngest group un- 
derwent implantation at 9 to 18 months of age, some 
of them may have met the language standard simply 
because their maximum potential for language gap 
was restricted by their young age at implantation. Sec- 
ond, care must be taken not to overinterpret the re- 
sults with respect to children who underwent implan- 
tation at older ages. Although the children in this sam- 
ple who underwent implantation after 30 months of 


age did not demonstrate the same degree of benefit 
for spoken language development, other benefits were 
noted (eg, development of at least some open-set com- 
munication abilities, speechreading enhancement, and 
sound awareness). 


There were likely a number of factors that contrib- 
uted to the more favorable outcomes demonstrated 
by the children who underwent implantation at 18 
months of age or younger. The most tenable factor 
may have been that early auditory stimulation through 
a cochlear implant contributed to more norma! matu- 
ration of the auditory pathways.? Such stimulation, 
in turn, may have allowed the infants to maximally 
benefit from critical periods of neural development 
that are important for spoken language development. 


Second, not only have criteria for cochlear implan- 
tation candidacy broadened in the past several years, 
but more has been learned about the potential out- 
comes for any child who uses a cochlear implant. 
Expectations after implantation have increased ac- 
cordingly. As such, the lower performance demon- 
strated by longer-term implant users (ie, primarily 
children who underwent implantation between 31 and 
40 or 41 and 48 months) may be reflective not only 
of delayed access to optimal auditory stimulation, 
but also of lower performance expectations that may 
have existed when these children first underwent im- 
plantation. In contrast, the infant group may have 
benefited both from greater sound access and from 
higher expectations from the outset. 


Third, there have been substantial advancements 
in implant technology and speech coding strategies. 
The first groups of children to receive cochlear im- 
plants generally had access to a device with | pro- 
gram memory and relatively limited speech coding 
strategies. Children who underwent implantation 
more recently have had access to a greater variety of 
speech coding strategies and greater programming 
flexibility. These improvements in technology posi- 
tively contribute to higher overall performance among 
the more recently, hence often younger, implanted 
children. 


Finally, Yoshinaga-Itano and Apuzzo!* and Downs 
and Yoshinago-Itano!> have documented that when 
children receive a diagnosis of hearing loss (ranging 
from mild to profound) and begin intervention within 
the first 6 months of life, they are more likely to 
achieve age-appropriate language skills than are chil- 
dren who receive a diagnosis and are treated after 6 
months of age. Nine children reported on in this ar- 
ticle were identified with hearing loss and began in- 
tervention before 6 months of age. Five of these chil- 
dren underwent implantation between 9 and 18 
months of age, | underwent implantation between 19 
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and 30 months, and 3 underwent implantation be- 
tween 31 and 40 months. The better performance dem- 
onstrated by the children in the youngest age group 
may reflect the language benefits of early identifica- 
tion. Although this may be the case, reviewing the 
language performance of all 9 early identified chil- 
dren suggested that the younger implant recipients 
outperformed the early-identified, later-implanted 
children (regardless of whether signed or spoken lan- 
guage abilities were considered). 


The data presented here document several in- 
stances in which children identified with severe-to- 
profound hearing loss who underwent implantation 
at 18 months of age or younger have achieved nor- 





mal or near-normal spoken language skills. Although 
the group sizes were small, there was a clear pattern 
of performance, with the highest outcomes among 
the children who underwent implantation at 18 months 
of age or younger and the poorest outcomes in the 
children who underwent implantation between 41 and 
48 months. As such, the data give credence to the 
existence of critical periods of neural development 
and evidence the ability of young implanted children 
to take advantage of such sensitive periods of devel- 
opment. These data have significant implications for 
early identification and intervention programs, the 
age of implantation, and counseling of families re- 
garding realistic expectations both before and after 
implantation. 
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POSTIMPLANTATION AUDITION AND EDUCATIONAL ATTAINMENT 
IN CHILDREN WITH PRELINGUALLY ACQUIRED PROFOUND 
DEAFNESS 


ARTHUR BOOTHROYD, PHD 
SAN DIEGO, CALIFORNIA 


DOROTHY BOOTHROYD-TURNER, MED 
TORONTO, CANADA 


Language and educational attainment were evaluated over a 4-year period in 8 profoundly deaf, orally trained child users of the 
Nucleus multichannel implant with the spectral peak (SPEAK) processor. The average age at implantation was 5.8 years. In terms of 
audition and language, these children resembled severely deaf children, with unaided hearing losses in the region of 70 to 75 dB, who 
use hearing aids. In tests of educational attainment and literacy, average performance lagged behind grade placement by an amount 
that increased with increasing language demand of the task. The benefits of implantation are considerable; nevertheless, auditory 
limitations, combined with language deficits already present at the time of implantation, present a continuing educational challenge. 
Earlier implantation, when the auditory system is more plastic and language delays are automatically minimized, may well produce 


better outcomes, but this advantage remains to be demonstrated. 


KEY WORDS — audition, cochlear implant, congenital deafness, educational attainment, hearing aid, language, literacy, pedi- 
atric deafness, prelingual deafness, profound deafness, speech perception. 


INTRODUCTION 


Congenital hearing loss, by its impact on spoken 
language acquisition, has far-reaching implications 
for intellectual and social development, literacy, edu- 
cational attainment, employment, and quality of life. 
The severity of the consequences depends on many 
factors. Some are intrinsic to the child, some a func- 
tion of the family, some a function of society at large, 
and some a function of audiological, clinical, and 
educational management.! Among these factors, the 
child’s assisted auditory capacity plays a major role, 
assuming, of course, that spoken language acquisi- 
tion is included in the goals of intervention. 


It is clear that auditory capacity with an implant is 
significantly better than aided auditory capacity in a 
high proportion of profoundly deaf children. It is rea- 
sonable to expect, therefore, that the provision of a 
multichannel cochlear implant to a previously aided, 
profoundly deaf child, combined with appropriate 
management after implantation, should also improve 
speech, language, literacy, and educational attainment 
(Fig 1). 

This prediction was tested in the study to be re- 
ported. A group of children with prelingually acquired 
profound deafness received cochlear implants while 
they were enrolled in a special mainstream-support 
program within the Metropolitan Toronto school sys- 
tem. All had previously worn hearing aids and had 
received auditory and spoken-language training. The 


present study began a few years after implantation. 
Comprehensive testing was followed by long-term 
follow-up in order to answer the following questions. 


1. How well do these children hear? 


2. How does their hearing compare with that of 
severely and profoundly deaf children with hearing 
aids? 

3. How does their hearing compare with that of 
other children with implants? 


4. What is the long-term impact of the improved 
hearing on language, literacy, and educational attain- 
ment? 





Better literacy and attainment | 


Faster and better language acquisition 


| Better speech perception and production | 























Increased auditory capacity 








[ Profound hearing loss | 





Fig 1. Immediate and long-term goals of cochlear implan- 
tation in children. 
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Fig 2. Background data for 8 children with implants at 
beginning of present study. 


PATIENTS AND METHODS 


Children. The initial group consisted of 5 girls and 
3 boys. All 8 children were believed to have been 
deaf since birth. The cause of deafness was genetic 
for 2 and uncertain for 6. The age at identification 
averaged 1.4 years. The better-ear 3-frequency aver- 
age hearing loss was in excess of 107 dB. After iden- 
tification, all children had been fitted with hearing 
aids and received auditory-oral training and/or audi- 
tory-verbal therapy. The duration of hearing aid use 
before implantation averaged 4.3 years. The age at 
implantation averaged 5.8 years. At the beginning 
of this study, the duration of implant use averaged 
4.9 years, and the current age averaged 9.5 years. As 
a group, these children were of average intelligence, 
as measured by the performance (nonverbal) sub- 
score of the Wechsler Intelligence Scale for Children 
(WISC-III). The average IQ was 102. One of the chil- 
dren, however, obtained a low score on this test: an 
IQ of 72 suggesting difficulties at the nonverbal lev- 
el. The IQs of the other 7 children ranged from 96 to 
126. Individual and average background data are il- 
lustrated in Fig 2. 


Education. As stated above, all children had re- 
ceived auditory-oral or auditory-verbal training af- 
ter initial diagnosis and hearing aid fitting. At the 
beginning of this study, they were enrolled in a spe- 
cial program developed by the Metropolitan Toronto 
School Board. Instruction and communication were 
by spoken language. Mornings were spent in a spe- 
cial class in which the children received language 
instruction, combined with preparatory and support- 
ive teaching. Afternoons were spent in a mainstream 
setting, typically 1 grade level below their chrono- 
logical age. In addition, some of the children received 
after-school training from an auditory-verbal thera- 
pist. During the 4 years of this study, several of the 
children graduated to full mainstreaming. 


œ= Audio-visual (best production) 


== Audio-only (auditory perception) 
100 ; - 
| | 
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Fig 3. Scores on Imitative Test of Speech Pattern Con- 
trast Perception (IMSPAC) obtained by 8 children with 
implants, via their implants. at beginning of present study. 


Sensory Assistance. All children used the Nucleus 
multichannel implant with the SPEAK processor. 
They also used a Phonic Ear FM system (471/475) 
linked to their processors. A hearing-impaired teacher 
who used the same implant carried out weekly checks 
of this system.* One of the children wore a hearing 
aid in the nonimplanted ear. 


Study Method. This was a 4-year study. Tests of 
auditory speech perception, intelligence, and elicited 
language were administered in the first year. Aca- 
demic attainment was then tracked by use of stan- 
dardized tests given at |-year intervals. The test pro- 
cedures and results are presented below in relation 
to the research questions outlined in the Introduc- 
tion. 


RESULTS 


Question 1: How Well Do These Children Hear? 
Figure 3 shows individual and group scores on the 
Imitative Test of Speech Pattern Contrast Perception 
(IMSPAC).? This test measures the subject’s abil- 
ity to convey phonetic contrasts to normally hearing 
listeners by the imitation of syllables that are pre- 
sented live. For audiovisual testing, the models are 
presented by hearing and lipreading and supple- 
mented by text. The results provide an estimate of 
best production skills. For audio-only testing, the 
models are presented by hearing alone, and the re- 
sults provide an estimate of auditory speech percep- 
tion capacity up to any limit imposed by production. 
The group mean scores of the children in this study, 
averaged across several contrasts, were around 80% 
for audiovisual presentation and around 70% for au- 
dio-only presentation. 


Figure 4 shows the group’s average audio-only 
score for the IMSPAC together with scores for the 
recognition of phonemes in consonant-vowel-con- 
sonant (CVC) words, whole CVC words in isola- 
tion, words in difficult (grade 5) sentences, and words 


# 
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Fig 4. Group mean performance on several tests of speech 
perception for 8 children with implants at beginning of 
present study. Error bars show standard error of mean. 
CVC — consonant-vowel-consonant. 


Words in 





in easy (preschool) sentences. Phoneme recognition 
and isolated word recognition were measured from 
repetition of the Arthur Boothroyd short isophonemic 
word lists.46 Word recognition in sentences of known 
topic was measured from repetition of materials de- 
veloped at the City University of New York for this 
and related projects. As a group, these children were 
able to recognize, via their implants, 82% of the vow- 
els and 58% of the consonants in CVC words, 34% 
of the whole CVC words in isolation, 62% of the 
words in difficult sentences, and 85% of the words 
in easy sentences. These data confirm that the pho- 
netic contrast information provided by the implants 
was being used for the auditory perception of pho- 
nemes and words. In addition, the strong effects of 
sentence context and sentence difficulty suggest ex- 
cellent use of world and language knowledge in the 
speech perception process.’.8 
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Fig 5. Median and interquartile range of audio-only 
IMSPAC scores for 8 children with implants in present 
study, compared with similar data for 5 groups of aided 
children with various amounts of unaided hearing loss. 
As a group, the children with implants show performance 
similar to that of aided children with severe loss in re- 
gion of 70 to 75 dB. 
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Fig 6. Distribution of composite audio-only IMSPAC 
scores of 8 children involved in present study (bottom 
panel) as compared with those of 2 groups of child users 
of Nucleus implant with earlier version of speech pro- 
cessor. All children in comparison groups had at least 2 
years’ experience with their implants. TC — total com- 
munication; MSP — mini speech processor; SPEAK — 
spectral peak. 


Question 2: How Does Their Hearing Compare 
With That of Severely and Profoundly Deaf Children 
With Hearing Aids? On the left of Fig 5 are the me- 
dians and interquartile ranges of audio-only IMSPAC 
scores obtained by groups of child hearing aid users 
with various degrees of hearing loss. These data were 
obtained in an earlier study.>> The line is a least- 
squares linear fit to the medians. On the right of Fig 
5 are the median and interquartile range of the au- 
dio-only IMSPAC scores obtained by the 8 children 
with implants involved in the present study. It will 
be seen that their performance approximates that of 
severely deaf hearing aid users who have unaided 
hearing losses in the region of 70 to 75 dB.? Similar 
analysis of phoneme recognition supports the con- 
clusion that the auditory speech perception capacity 
of these children was similar to that of aided severely 
deaf children.® This level of performance is dramati- 
cally better than that predicted for hearing aid users 
with a loss of 107 dB, which is the best estimate of 
the outcome that would have been attained had they 
continued to use hearing aids. Nevertheless, their per- 
formance is well below that expected from children 
with normal hearing.!° 


Question 3: How Does Their Hearing Compare 
With That of Other Children With Implants? In Fig 
6, the distribution of audio-only IMSPAC scores in 
this group of children can be compared with those of 
2 groups of children with implants involved in an 
earlier study. The children in the comparison groups 
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Fig 7. Verbal and nonverbal IQs of 8 children with im- 
plants in present study, as measured by Wechsler Intelli- 
gence Scale for Children (WISC-III). DNT — did not 
test. 
all used the Nucleus implant, but with the earlier mini 
speech processor. All had at least 2 years’ experience 
by the time these data were collected. The data in the 
top panel were obtained from children in oral pro- 
grams. The data in the middle panel were obtained 
from children in total communication programs. It will 
be seen from Fig 6 that the distribution of implanted 
auditory capacity among the 8 children in the present 
study was similar to that of the more successful orally 
trained users of the MSP processor in the earlier study. 


Question 4: What Is the Long-Term Impact of the 
Improved Hearing on Language, Literacy, and Edu- 
cational Attainment? Two tests provided information 
on linguistic development at the beginning of the 
study. One was the verbal subscore of the WISC-III. 
One of the children was withdrawn from the study 
before this test was administered. The average ver- 
bal IQ of the other 7 was 64, which was well below 
their average nonverbal IQ of 103. These data, illus- 
trated in Fig 7, show that the language skills of these 
children were well below those of their hearing peers 
at the start of this study, despite an average of 5 years’ 
experience with the implant. 


The second test used to measure linguistic devel- 
opment was the Grammatical Analysis of Elicited 
Language (GAEL-C) of Moog and Geers.!! This test 
has been normed on children with normal hearing, 
severe hearing loss, and profound hearing loss. Fig- 
ure 8 shows the performance of the children in this 
study expressed in terms of percentiles for the 3 nor- 
mative groups. It will be seen that the average per- 
formance was at the 71st percentile when compared 
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Fig 8. Scores of 7 children with implants on Imitated 
and Prompted subtests of Grammatical Analysis of Elic- 
ited Language obtained at beginning of present study. 
Scores are expressed in terms of percentile ranking with 
reference to normative groups of profoundly deaf, se- 
verely deaf, and normally hearing children. 


with profoundly deaf children, at the 50th percentile 
when compared with severely deaf children, and at 
the 2nd percentile when compared with normally 
hearing children. These findings are in keeping with 
the data on auditory capacity. 


At the beginning of this study, and annually for 
the next 4 years, the Canadian Test of Basic Skills 
(CTBS) was administered to 7 of the children (1 hav- 
ing withdrawn during the first year). The CTBS is a 
general test of educational attainment, designed for 
norma'ly developing children. It is administered in 
written form with multiple-choice responses. Figure 
9 shows performance in grade level for 5 subtests as 
a function of actual grade placement. The heavy black 
line is a least-squares polynomial fit to the data (first- 
order for the first 3 panels and second-order for the 
last 2). The diagonal indicates performance appro- 
priate to grade placement. 


As a group, these children maintained performance 
at grace level on mathematical computation. Their 
performance on mathematical concepts and problem 
solving was somewhat below grade level, but the 
grade lag did not increase over time. On the reading 
and vocabulary tests, however, the grade lag was 
marked and increased over time. It is clear from these 
data that the disparity between performance and place- 
ment imcreased with increasing language demand of 
the test. This performance disparity between nonver- 
bal and verbal tasks parallels that described earlier 
for the WISC-III scores. Taken as a whole, these data 
sugges: that the enhanced auditory capacity provided 
by the implant had a positive effect on language de- 
velopment. It is clear, however, that serious deficits 
of language and literacy remained. 
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Grade 


Fig 9. Performance of 7 children with implants on 
5 subtests of Canadian Test of Basic Skills over 4- 
year period. Scores are shown in terms of grade- 
level attainment as a function of actual grade place- 
ment. Note that the disparity between performance 
and placement increases with increasing language 
load of task. 


Attainment 


DISCUSSION AND CONCLUSIONS 


The positive benefits of implantation for these chil- 
dren are not in doubt. From an auditory perspective, 
they were transformed from profoundly or totally deaf 
children into the equivalent of severely deaf children 
who wear hearing aids. This transformation was re- 
flected in terms of motor speech skills and, accord- 
ing to the GAEL results, language. During structured 
interviews, the parents also reported many benefits, 
including increased self-esteem, reduced isolation, 
closer relationships with siblings and peers, improved 
language, more intelligible speech, and ability to use 
the telephone. A universal reaction of the parents was 
that these observable benefits of the implant have 
resulted in brighter hope for the future. 


Nevertheless, it is also clear that the implant re- 
sulted neither in normal hearing nor in age-appropri- 
ate language and literacy. This group of children con- 
tinued to need educational and remedial support suit- 
able for children with severe hearing loss. Their ease 
of communication in high-redundancy conversation 
could be misleading. Recall that the average audi- 
tory-only word recognition score was 85% for easy 
sentences. But, when the children were presented with 
more difficult sentences of the type encountered in 
the classroom, this score dropped to 62%. Even when 
supplemented by speech-reading, this level of per- 
formance represents a serious impediment to learn- 
ing in a setting that is geared to the needs of normal- 
ly hearing children. It is, therefore, essential that cur- 
ricula, teaching strategies, and support services for 
children with implants address both auditory defi- 
cits and language delays. 


The grade lags in literacy are of serious concern. 


Such a finding, however, should not be surprising. 
The acquisition of new knowledge is based on exist- 
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ing knowledge. Moreover, for optimal learning, the 
proportion of new material to familiar material should 
be kept fairly low (in the region of 15%). A deficit in 
existing knowledge automatically increases the pro- 
portion of new material, thereby reducing learning 
effectiveness and leading to further deficits. This pro- 
cess is apparent in this group of children, even with 
intensive educational support. 


These children, on average, spent the first 1.4 years 
of their lives without hearing and the next 4.3 years 
with very limited aided hearing. By the time they 
received the auditory benefits of a multichannel co- 
chlear implant, the average age was almost 6 years. 
When entering school, therefore, they faced a double 
challenge. They had to deal with the auditory limita- 
tions of hearing with an implant, and they had to cope 
with the language delays that had accumulated by the 
time of implantation. 


Such considerations lead to the obvious prediction 
that earlier implantation, at an age when the audi- 
tory system is more plastic and any language delay 
is automatically small, would have led to a more nor- 
mal growth of language and literacy in these sub- 
jects. Also in support of this prediction is evidence 
that early auditory deprivation can lead to realloca- 
tion of perceptual resources in the auditory cortex.!2 
In fact, the 1 student in this sample whose language 
and literacy performance was at, or close to, grade 
placement also underwent implantation the earliest. 
It is important, however, that this issue be explored 
objectively over the long term and over larger stu- 
dent groups. In the meantime, the dangers of unreal- 
istic expectations and unquestioned assumptions are 
clear. The limitations of current cochlear implants 
must be acknowledged and educational support pro- 
vided according to individual needs. 
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In summary, children with prelingually acquired 
profound deafness who are fitted with a multichan- 
nel cochlear implant during the early school years can 
acquire dramatically enhanced auditory capabilities 
resulting in improved prognosis for spoken language 
skills, literacy, and educational attainment. Hearing 
with an implant, however, does have limitations, 
which are roughly equivalent to those of a severely 
deaf child fitted with hearing aids. The auditory limi- 





tations, combined with any language delays already 
present at the time of implantation, create a continu- 
ing need for educational management appropriate to 
hearing-impaired children, with particular emphasis 
on the development of language and literacy. It is 
reasonable to predict that early implantation would 
result in more age-appropriate language and literacy, 
but careful research will be needed to test this pre- 
diction. 
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DEVELOPMENT OF SPEECH PERCEPTION AND PRODUCTION IN 
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The purpose of the present study was twofold: 1) to compare the hierarchy of perceived and produced significant speech pattern 
contrasts in children with cochlear implants, and 2) to compare this hierarchy to developmental data of children with normal hearing. 
The subjects included 35 prelingual hearing-impaired children with multichannel cochlear implants. The test materials were the 
Hebrew Speech Pattern Contrast (HeSPAC) test and the Hebrew Picture Speech Pattern Contrast (HePiSPAC) test for older and 
younger children, respectively. The results show that 1) auditory speech perception performance of children with cochlear implants 
reaches an asymptote at 76% (after correction for guessing) between 4 and 6 years of implant use; 2) all implant users perceived 
vowel place extremely well immediately after implantation; 3) most implanted children perceived initial voicing at chance level until 
2 to 3 years after implantation, after which scores improved by 60% to 70% with implant use; 4) the hierarchy of phonetic-feature 
production paralleled that of perception: vowels first, voicing last, and manner and place of articulation in between; and 5) the 
hierarchy in speech pattern contrast perception and production was similar between the implanted and the normal-hearing children, 
with the exception of the vowels (possibly because of the interaction between the specific information provided by the implant 
device and the acoustics of the Hebrew language). The data reported here contribute to our current knowledge about the development 
of phonological contrasts in children who were deprived of sound in the first few years of their lives and then developed phonetic 
representations via cochlear implants. The data also provide additional insight into the interrelated skills of speech perception and 
production. 


KEY WORDS — cochlear implant, developmental speech perception, developmental speech production, hearing-impaired child, 


phonetic contrasts, phonological contrasts. 


INTRODUCTION 


Many studies have demonstrated the effectiveness 
of cochlear implant devices for enhancing speech per- 
ception and production and language skills in chil- 
dren with prelingual severe to profound hearing im- 
pairment.!-6 Advancements in speech coding strate- 
gies and electrode arrays, as well as the growing trend 
to perform implantation in children at a very young 
age, have all encouraged researchers to continue in- 
vestigating speech perception and production perfor- 
mance in children with implants. One such area of 
investigation relates to the development of speech 
in children whose only source of auditory informa- 
tion is from the implant device and to compare their 
development to that of normal-hearing cohorts. The 
purpose of the present study was, therefore, twofold: 
1) to examine the development of phonological con- 
trasts by both auditory perception and production in 
young children with implants as a function of im- 
plant use, and 2) to compare these data with develop- 
mental data from children with normal hearing (Ki- 
shon-Rabin et al, unpublished observations). The ra- 
tionale for choosing phonological contrasts was that 
they are minimally influenced by the listeners’ lin- 





guistic knowledge, can be explained in acoustic 
terms,’ and are known to predict speech performance 
of higher linguistic redundancy.® 


METHODS 


Participants. Thirty-five children between 2.5 and 
10 years of age participated in the perception tasks 
of this study. All children had profound hearing loss 
(pure tone average greater than 110 dB hearing level) 
before implantation. The cause of deafness was un- 
known in 23 cases. The remaining causes were hered- 
ity (7 cases), Waardenburg’s syndrome (2 cases), 
Mondini dysplasia (1 case), rubella (1 case), and cy- 
tomegalovirus (1 case). Before implantation, more 
than 50% of the children used total communication, 
11 were in auditory-oral programs, and 3 used an 
auditory-verbal approach. Most of these children 
changed to either auditory-oral or auditory-verbal ap- 
proaches immediately after implantation. Their age 
at implantation ranged between 2.5 and 10 years (av- 
erage, 4.6 years), and they had used their implant 
device between 0.3 and 6.5 years (average, 1.6 years). 
The majority of the children underwent implantation 
with the multichannel cochlear implant manufactured 
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Fig 1. Mean perception (filled circles) and production 
(open diamonds) of speech contrasts (after correction for 
guessing) of each implanted child as function of years of 
implant use. Four implanted children who perceived ex- 
ceptionally well are indicated by circled cross. Solid lines 
are best-fitting exponential learning curves. Confidence 
limit at 0.01 is 20% after correction for guessing. 
by Cochlear Ltd and used the spectral peak (SPEAK) 
stimulation strategy. Five children underwent implan- 
tation with the Clarion Multi-Strategy cochlear im- 
plant device. Seventeen of these children were se- 
lected for speech production evaluation. They were 
selected on the basis of availability and after screen- 
ing for possible articulation problems not related to 
their hearing loss. The average age of this group was 
6.5 years (range, 3 to 10 years). Their average age at 
implantation was 4.1 years (range, 2.5 to 8 years). 
All were using the SPEAK stimulation strategy. 


Speech Perception Measures. The Hebrew Speech 
Pattern Contrast (HeSPAC)? test adapted from En- 
glish! was used to assess the auditory speech per- 
ception skills of the children with implants who ac- 
quired reading skills. For the younger children, a pic- 
ture version of the HeSPAC was used, also known 
as the Hebrew Picture Speech Pattern Contrast (HePi- 
SPAC)!! test. Both tests use a binary forced-choice 
paradigm to estimate the probability of perception of 
phonologically significant contrasts. All stimuli were 
consonant-vowel-consonant (CVC) meaningful 
words. Stimuli of the HePiSPAC test met 2 additional 
requirements: |) they can be presented in a picture 
form, and 2) they are included in the inventory of 
words frequently produced by preschool Hebrew- 
speaking children. The auditory perception of 8 seg- 
mental contrasts was assessed: vowel height, vowel 
place, and place of articulation, manner, and voicing 
in initial and final word positions. For each child, 
the score of a given significant contrast was based 
on 18 to 24 trials. Before testing, the child was fa- 
miliarized with the test items and their correspond- 
ing pictures (when relevant). The tester verified the 
knowledge of the words by either requesting the child 


to read, name the picture, or point to the target word 
presented in audiovisual modes supplemented at times 
with marual.communication. This procedure was also 
to confirm that the task was understood and was not 
limiting performance (especially of the youngest sub- 
jects). The subjects were tested individually in a 
sound-treated room. They were instructed to point 
to the picture corresponding to the word they heard. 
Contrasts were presented randomly across subjects. 
Verbal feedback and tangible rewards were provided. 


Speec’ Production Measures. Thirty 1-syllable 
CVC werds were chosen from the HePiSPAC test. 
Each child was asked to name the picture. If, how- 
ever, a response could not be elicited in this manner, 
a model was presented in an effort to obtain an imi- 
tative response. Speech production was measured 
over tim2 at l-year intervals from 1 to 4 years after 
implantation. Note, however, that not all of the sub- 
jects were available at all times. The production of 
each chi d at a given recording time was presented 
to 3 normal-hearing listeners who were not familiar 
with the speech of the hearing-impaired. In order to 
avoid learning of the test stimuli by the judges, a 
total of 84 listeners participated throughout this study. 
The listeners were requested to write exactly what 
they heard even if the response was incomplete or 
not meaningful. Consonant and vowel accuracy was 
then examined according to the significant speech 
features. These values were corrected for guessing, 
following the reasoning introduced by Boothroyd et 
al!? and adapted to the Hebrew language. We evalu- 
ated speech production in open-set format on the ba- 
sis of a study that compared both closed-set and open- 
set methods. !? The results of this study demonstrated 
unequivocally that interjudge variability was signifi- 
cantly smaller in the open set and not as previously 
reported !4 Furthermore, the validity of open-set test- 
ing is considered greater. !5 


RESULTS 


The average perception score of the phonological 
contrasts for each child (after correction for guess- 
ing) as a function of years of implant use is shown in 
Fig 1. With the exclusion of 4 children (circled), au- 
ditory speech perception performance appears to fol- 
low an exponential function (r [32] = 0.65; p < .01) 
that reacres its 90% growth at approximately 4 years 
of implaat use. The data also show that after 6 years 
of implant use, the mean HeSPAC test scores reach 
approxinately 76% (after correction for guessing). 
Fairly large variability in performance is observed, 
suggesting significant influence of additional factors 
other then implant use on the results. There are too 
few children to separate the effects of confounding 
factors such as age at implantation, mode of com- 














Kishon-Rabin et al, Speech Perception & Production in Children With Cochlear Implants 87 


3 8 


Mean Contrasts 


(% Correct re: guess) 








22 25 -30 35 -40 45 50 55 66 65 70` 


Chronological Age 
(Months) 

Fig 2. Mean perception (filled circles) and production 
(open diamonds) of speech contrasts (after correction for 
guessing) of each child with normal hearing as function 
of chronological age. Solid lines are best-fitting expo- 
nential learning curves. Confidence limit at 0.01 is 20% 
„after correction for guessing, 


munication before implantation, years of implanta- 
tion, chronological age, and hearing status before im- 
plantation. Nonetheless, we-were able to character- 
ize the 4 children who demonstrated exceptionally 
high performance between 0.5,'and’2:years after im- 
plantation as all having usable hearing before im- 
plantation and as all being trained with the auditory- 
verbal approach. Examination of individual contrasts 
showed that vowel place was found to be perceived 
by all implant users, scoring above 90% correct (af- 
ter correction for guessing) regardless of years of im- 
plant use. Initial voicing, on the other hand, was per- 
ceived at guessing level for most children (but not 
all) until 2 to 3 years of implant use. Continuous use 
of the device increased performance to 60% to 70% 
(after correction for guessing) at 5 to 6 years of im- 
plant use. The range of results for the remaining con- 
trasts — vowel height, manner, and place of articu- 
lation — fell (in that order) between the best- and 
worst-perceived contrasts. 


Figure 1 also illustratés the production accuracy 
of the mean phonological contrasts for each child 
(after correction for guessing) as a function of years 
of implant use, as well as the best-fitting function to 
the data. There is an increase in performance in the 
first couple of years of implant use, after which pro- 
duction accuracy appears to level off at approxi- 
mately 80%. Inspection of the individual contrasts 
reveals that accuracy of production of all contrasts 
improved with implant.use (p < .0001). Similar to 
perception, vowel place and voicing were the best- 
and worst-produced contrasts, respectively, when 
tested first 1 year after implantation (average of 70% 
after correction for guessing and chance level, respec- 
tively). The remainder of the contrasts were produced 


at an accuracy of approximately 40% to 50% after 
correction for guessing after the first year of implant 
use. By 4 years of implant use, the accuracy of per- 
formance improved the most for voicing, to 61% (af- 
ter correction for guessing). Improvements of 30% 
to 45% correct were also demonstrated for the re- 
mainder of the contrasts. 


DISCUSSION 


The results of the present study reveal substantial 
improvements in the mean perception of phonologi- 
cal contrasts as a function of implant use. Percep- 
tion performance reached 70% to 80% correct (after 
correction for guessing) after approximately 4 years 
of implant use and is similar to the performance 
shown by Boothroyd and Boothroyd-Tumer (this 
supplement pp 79-84) for their best implant users on 
the Imitative Test of Speech Pattern Contrast Per- 
ception (IMSPAC), but higher than that previously 
published for children with implants.!.! Interestingly, 
developmental data of children with normal hearing 
show that they reach 90% of performance at approxi- 
mately 4 years of age, as well (Kishon-Rabin et al, 
unpublished observations). Our accuracy-of-produc- 
tion results are also in keeping with recently pub- 
lished data-!7 and are better than those reported by 
others.7-!8-21 The substantial improvements reported 
in the present study may be attributed in part to the 
advanced speech processing used currently by the 
children with implants. There is evidence to suggest 
that the SPEAK, continuous interleaved sampler 
(CIS), and simultaneous analog stimulation (SAS) 
processing strategies provide better spectral and tem- 
poral representation of the signal than that provided 
by the older strategies, which were primarily based 
on feature extraction approaches, such:as-mini speech 
processor (MSP) and multipeak (MPEAK).?2-*4 The 
finding that there are several children who reach per- 
ception performance of 90% or more with relatively 
little auditory experience with the device supports 
the notion that the implant device transmits the speech 
signal effectively. Why only few children were able 
to do so with little auditory experience may be at- 
tributed to the fact that these children were trained 
with the auditory-verbal method and had usable hear- 
ing before implantation. The data are. few, but sup- 
port the importance of early auditory experience and 
intensive aural rehabilitation in children with i im- 
plants. ... : 


Ghnaiion between the perception and: pode: 
tion of phonological contrasts in children with im- 
plants suggests that for this particular group of hear- 
ing-impaired children, production skills develop be- 
fore perception (Fig 1). Data on young.children with 
normal hearing, on the other hand, show an opposite 
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trend, as illustrated in Fig 2. The general course of 
normal development is such that auditory speech per- 
ception precedes speech production. We thus suspect 
that the age of the children with implants in the pres- 
ent study contributed to the finding that production 
preceded perception. Because they were deaf for the 
first 2 years of their lives (or more), speech produc- 
tion developed primarily on the basis of visual cues. 
We assume that as the age of implantation continues 
to decrease and young infants will undergo implan- 
tation, their perception-production development 
curves will resemble those of children with normal 
hearing. 


Of the individual contrasts, the contrast that was 
best perceived and best produced by the implanted 
children was vowel place. This contrast was per- 
ceived extremely well by a// children within the first 
year of implant use. Young children with normal hear- 
ing and aided hearing-impaired children, on the other 
hand, best perceive vowel height!! (also Kishon-Ra- 
bin et al, unpublished observations). These differ- 
ences between the children with implants and the oth- 
er groups of children are attributed to the reduced 
number of vowels in Hebrew (a total of 5) and thus 
to the reduced vowel space.?> One of the implica- 
tions of this reduced vowel space is the smaller acous- 
tic difference that exists between the vowels (first 
formant [F1] values in particular) as compared to dif- 
ferences measured in other languages. Acoustic analy- 
ses of our recordings revealed that F1 values of vow- 
els that differ by tongue height fell within the fre- 
quency range of the same electrode or adjacent elec- 
trodes (according to the electrode frequency alloca- 
tion tables), thus explaining the poor vowel height 
perception of the implant users. In contrast, several 
electrodes separate between second formant (F2) val- 
ues that mediate differences in vowel place. Sensa- 
tions from electrodes that are significantly apart from 
each other appear to be immediately noticeable even 
with little implant experience and do not predict 
speech perception performance. This finding has im- 
portant clinical implications. Vowel place perception 
may serve as a general indicator for MAP fitting. In 
other words, if vowel place is perceived at a below- 
chance level, the fitting parameters of the implant 
device may need to be reexamined. 


That children with implants produced the vowel 
place contrast with the most accuracy is similar to 
another finding in children with severe to profound 
hearing loss.!3 These data are in keeping with what 
is known regarding the visibility of this contrast,!° 
as well as with the assumption that speech produc- 
tion is highly dependent on visual information. The 
data reported here also coincide with the finding that 
the production of vowel place is first to develop in 


normal-hearing children (Kishon-Rabin et al, unpub- 
lished observations). These results are not surpris- 
ing, considering that in Hebrew there are only 2 sets 
of vowels that differ in tongue position: /i/ versus /u/ 
and /e/ versus /o/. For both sets of contrasting stimuli, 
lip configuration differences are very apparent (spread 
versus round) and probably serve as the major visi- 
ble cue to the contrast. 


The contrast that was last to develop by percep- 
tion and production in our children with implants was 
voicing. These data are in keeping with those reported 
by Tye-Murray et al*°: after 2 years of implant use, 
children showed the poorest performance in the per- 
ception and production of this contrast. Interestingly, 
voicing was also the last contrast to be perceived and 
produced by young normal-hearing children (Kishon- 
Rabin et al, unpublished observations). The difficulty 
in perceiving voicing in initial position may be ex- 
plained in part by the inability to speech-read this 
contrast. Studies have shown that information about 
the timing of voice onset in relation to the articula- 
tory movement is not available when presented vi- 
sually in either English?’28 or Hebrew.’ Similarly, 
Hebrew voicing in final position cannot be perceived 
visually? or when supplemented with sensory aids.?9 
This can be explained by the small difference be- 
tween the length of the vowel that precedes final 
voiced Hebrew plosives and the length of the vowel 
that precedes final unvoiced Hebrew plosives’; the 
difference is greater in English. The inability to per- 
ceive Hebrew voicing visually implies that any de- 
velopment or formation of phonetic prototypes of this 
contrast is highly dependent on auditory information. 
If such information does not exist, as in prelingually 
profoundly hearing-impaired children, the contrast 
would not develop. This idea is confirmed by the 
chance-level scores of voicing (in both perception 
and production) in children whose experience with 
the implant device was short. Once hearing is par- 
tially restored via cochlear implants, voicing informa- 
tion is transmitted effectively via audition and in con- 
junction with the visible articulatory movements. 
This idea is further confirmed by the dramatic im- 
provements in the perception and production of this 
contrast after implantation. The finding that young 
children with normal hearing also had difficulty in 
perceiving and producing voicing suggests that nor- 
mal speech development is guided by the ease of ac- 
cessibility and perceptual salience of speech cues. 
Thus, relative timing of events arriving from differ- 
ent modes, or alternatively from one mode only (with 
no visual support), appears to be a difficult and de- 
manding task for young children with normal hear- 
ing. 

Finally, the data reported here contribute to our 
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current knowledge about the development of phono- 
logical contrasts in children whose auditory cortex 
and language-related areas were stimulated for the 
first time after years of deafness with speech stimuli 
via a cochlear implant. These children show develop- 
ment similar to that of children with normal hearing. 
This similarity suggests that the development of 
speech depends on the accessibility of acoustic cues 
and appears to be the same for children with normal 
hearing and those with implants and thus is not af- 
fected by years of auditory deprivation. The one ex- 





ception is vowel place, which children with implants 
perceived very well immediately after implantation. 
This finding is probably the result of some interac- 
tion between the processing strategy and the acous- 
tics of Hebrew. The finding that voicing performance 
increases dramatically with use of the implant de- 
vice confirms the accessibility of the acoustic signal 
via electrical stimulation. The fact that voicing per- 
ception enhanced voicing production provides fur- 
ther insight into the relationship between these two 
skills. 
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IMITATION OF NONWORDS BY DEAF CHILDREN AFTER COCHLEAR 
IMPLANTATION: PRELIMINARY FINDINGS 
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Fourteen prelingually deafened pediatric users of the Nucleus-22 cochlear implant were asked to imitate auditorily presented 
nonwords. The children’s utterances were recorded, digitized, and broadly transcribed. The target patterns and the children’s imita- 
tions were then played back to normal-hearing adult listeners in order to obtain perceptual judgments of repetition accuracy. The 
results revealed wide variability in the children’s ability to repeat the novel sound sequences. Individual differences in the compo- 
nent processes of encoding, memory, and speech production were strongly reflected in the nonword repetition scores. Duration of 
deafness before implantation also appeared to be a factor associated with imitation performance. Linguistic analyses of the initial 
consonants in the nonwords revealed that coronal stops were imitated best, followed by the coronal fricative /s/, and then the labial 
and velar stops. Labial fricatives were poorly imitated. The theoretical significance of the nonword repetition task as it has been used 
in past studies of working memory and vocabulary development in normal-hearing children is discussed. 


KEY WORDS — child, cochlear implant, language, nonword, nonword repetition, phonology, speech, word recognition. 


INTRODUCTION 


A primary goal of our current research program is 
to account more fully for the large individual differ- 
ences in spoken language development that are typi- 
cally observed after cochlear implantation in prelin- 
gually deafened children.! As part of this effort, we 
have recently begun using a nonword repetition task 
to study individual variation in how effectively chil- 
dren with cochlear implants (CIs) can use sublexical 
phonological knowledge about the sound patterns in 
their ambient language to decode spoken nonsense 
words. a 


In a typical nonword repetition task, the child is 
asked to listen to a novel nonsense word and to re- 
peat it back aloud.?.3 The child is warned in advance 
that the stimuli will be unfamiliar and is told to ap- 
proximate the items to the best of his or her ability. 
Despite its surface simplicity, nonword repetition is 
a complex information processing task that requires 
a child to perform reasonably well in each of its com- 
ponent processes. These component processes in- 
clude auditory and phonological encoding, short-term 
storage of the target item in working memory, and 
speech production. 


Our use of a nonword repetition task with CI us- 
ers was motivated in part by an ongoing debate among 
speech perception researchers regarding the degiee 


to which knowledge of word meaning plays a role in 
the identification of the individual constituent pho- 
nemes that constitute words.* This debate has car- 
ried over to discussion of how a child with a CI is 
able to perform a typical open-set speech perception 
task in which he or she is asked to repeat back spo- 
ken words. It has been suggested that a hearing-im- 
paired child will be severely disadvantaged if he or 
she does not know the meaning of the target word to 
be repeated.> On the basis of some prior informal 
testing of a few children who are considered “star” 
CI users, we believe that in situations in which the 
child has reason to expect an unfamiliar word, the 
disadvantage may be somewhat overestimated for 
children with considerable implant experience. The 
actual linguistic behavior of these childrén suggests 
that they possess the ability, like normal-hearing-chil- 
dren, to reproduce unfamiliar but “word-like” tar- 
gets by using their knowledge of the phonological 
patterns present in their ambient language. 


The nonword repetition task has attracted a great 
deal of interest over the past decade among research- 
ers in language acquisition and developmental psy- 
chology because the performance of normal-hearing 
children on this task has been found to be correlated 
with individual differences in real-word vocabulary 
acquisition.”6 Many studies of special populations 
known to experience problems with language or other 
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cognitive processing skills have subsequently been 
carried out with this procedure.’ As a result of re- 
viewing this research, we became interested in the 
nature of the nonword repetition task and its rela- 
tionship to spoken language development. 


The present study was designed to answer the fol- 
lowing questions. First, how much variability is ob- 
served on the nonword repetition task in this clinical 
population? Second, are individual differences among 
children in each of the component processes of speech 
perception, working memory, and speech production 
reflected in nonword repetition performance? Finally, 
what specific phonological changes or distortions are 
evident in the imitation responses, and how are these 
errors distributed among the children? 


METHODS 


Participants. The 14 children who participated in 
this research were recruited as part of a larger study 
carried out at Central Institute for the Deaf in St 
Louis, Missouri.® All of the participants were 8 or 9 
years of age. Eleven of the children were reported to 
be congenitally deaf. The remaining 3 became deaf 
before 2 years of age. The average duration of deaf- 
ness before implantation was approximately 3 years 
(range, | year 6 months to 5 years 3 months). All of 
the children had used their implants for at least 3.5 
years before the present testing (mean use, 5 years 6 
months). Both oral and total communication children 
were represented in the group. All of the children 
used the Nucleus-22 device and the spectral peak 
(SPEAK) coding strategy. 


Stimulus Materials and Procedure. The children 
were tested with a subset of 20 nonwords selected 
from the 40-item Children’s Test of Nonword Rep- 
etition (CNRep).? The CNRep was originally devel- 
oped to measure individual differences in phonologi- 
cal working memory in young children with normal 
hearing. All items on the CNRep are phonotactically 
legal sequences in English. Although there are some 
disadvantages to using this particular test, it was used 
in the present study because of the existence of a 
large body of previously published data obtained with 
these stimulus materials. 


The subset of 20 nonwords was chosen by elimi- 
nating the 20 items that showed the least amount of 
variance in scores obtained previously in our labora- 
tory from 5-year-old children with normal hearing. !° 
We also eliminated some nonwords that were essen- 
tially common real words attached in an unfamiliar 
manner to a standard affix. Five nonwords remained 
at each of 4 word lengths: 2, 3, 4, and 5 syllables. 
Because the existing recordings of the CNRep were 
spoken in British English, the 20 target stimuli for 


the present study were re-recorded in the voice of a 
female adult speaker of American English. !° 


To elicit the children’s imitations, we played each 
target item via a high-quality loudspeaker at approxi- 
mately 70 dB sound pressure level. In a few cases, 
the level was increased at the child’s request. The 
child was asked to “repeat each silly, made-up word” 
back to the examiner. A head-mounted microphone 
was used to record all responses onto digital audio- 
tape. These audio recordings were later segmented 
with a digital waveform editor into individual sound 
files for use in the perceptual study. 


Although 43 children were initially tested with this 
task, only 14 children provided responses to all 20 
tokens. This outcome should not be interpreted to 
mean that most of the children were unable to carry 
out the task. In actuality, 88% of all children tested 
produced a response to 15 or more of the 20 non- 
words. Missing tokens usually resulted from the 
child’s failing to respond on a given trial. In a few 
cases, tokens were eliminated from the final data 
analysis because of problems with the recording pro- 
cedure. This subselection of participants did, how- 
ever, cause the remaining sample of children to con- 
tain a somewhat greater proportion of better-perform- 
ing children than existed in the overall group of 43. 
Even so, as will be shown below, the children in the 
smaller group varied quite widely on many of the 
measures reported. 


In the past, nonword repetition performance has 
usually been quantified with a binary scoring proce- 
dure crediting the child either 1 point or no points 
for each target item. Any error, even of only a single 
segment (phoneme), usually resulted in no credit, al- 
though some provisions were made for predictable 
patterns of immature articulation in very young chil- 
dren.?3 This scoring procedure was not suitable for 
use in the present study because the children with 
CIs frequently made multiple segmental errors. More- 
over, because we were specifically interested in ex- 
amining the nature of the children’s errors, a more 
sensitive measure of nonword repetition accuracy 
was needed. 


In the present article, we report results of two dif- 
ferent assessment methods: one based on a behav- 
ioral perceptual measure, and the other derived from 
a partial linguistic analysis of the children’s produc- 
tions. For the perceptual measure, we used ratings 
of repetition accuracy gathered from 10 monolingual 
English-speaking adult listeners with normal hear- 
ing who reported minimal to no experience with the 
speech of deaf or hearing-impaired persons. The per- 
ceptual ratings were obtained in the following man- 
ner. On each of 280 randomized trials, the listener 
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Number of Children 





Otol Lto2 2to3 3to4 4to5 Sto6 


Totally fails Perfect 
+ 
to resemble the rendering of the 


target utterance target utterance 


Frequency histogram illustrates distribution of percep- 
tual nonword repetition accuracy ratings averaged across 
items and listeners. 


heard a target nonword stimulus, followed after 1 
second of silence by a child’s imitation response. The 
listener then rated the child’s imitation response on 
a 7-point scale with end point labels of 0 (“totally 
fails to resemble the ‘target’ utterance”) and 6 (“per- 
fectly accurate rendering of the ‘target’ utterance, ig- 
noring differences in pitch”). 


A partial linguistic assessment of the children’s 
imitations was also carried out. In the present article, 
we report results based only on our analyses of the 
accuracy of the initial consonant and the number of 
syllables produced in each imitation response. In 
order to complete these analyses, every imitation was 
listened to and transcribed by the second author. All 
imitations were also transcribed by the first author. 
There was 92% agreement on the transcription of 
the initial consonants and 85% agreement on num- 
ber of syllables produced. When in conflict, the sec- 
ond author’s transcriptions were used. 


Because 3 target items began with an initial vowel 
or liquid, the analysis of the initial consonants in- 
volved only the 17 target nonwords with initial ob- 
struents. These 17 nonwords included the following 
initial target consonants: /p/ in 3 instances, /b/ in 3 
instances, /t/ in 1 instance, /d/ in 2 instances, /k/ in 2 
instances, /g/ in 1 instance, /f/ in 1 instance, /v/ in 2 
instances, and /s/ in 2 instances. This set therefore 
included 2 different manners of articulation (stop and 
fricative), consonants from each of the 3 gross places 
of articulation (labial, coronal, and dorsal), and both 
voiced and voiceless consonants. 


Each imitation response was scored in 4 different 
ways on the basis of its initial consonant. The first 
method was a segment-based accuracy score. An 


imitation was counted as correct and given | point if 
the initial consonant was correctly reproduced. No 
partial credit was assigned for any incorrect produc- 
tions. Each of the 3 remaining scoring methods re- 
flected the accuracy of a specific phonological fea- 
ture of the initial consonant: the place feature (la- 
bial, coronal, dorsal), the manner feature (stop, fric- 
ative), or the voicing feature (voiced, voiceless). In 
these score assignments, | point was given to a child’s 
response if it was a correct imitation simply in terms 
of the feature in question. For example, in scoring 
place of articulation, if the target segment was a la- 
bial such as /p/, then a point was given for any imita- 
tion that began with a labial sound (ie, any response 
beginning with a /p/, /b/, /f/, /v/, or nonphonemic la- 
bial sound was scored as correct). 


In one final analysis, each imitation response was 
also scored for the number of syllables produced. 
For this analysis, responses to all 20 of the target 
nonwords were examined and scored. 


RESULTS 


Perceptual Results. To answer our first question 
regarding how much variability among children we 
would observe on the nonword repetition task, we 
examined the group distribution of the perceptual 
ratings averaged across all items and listeners. As 
illustrated in the Figure, the group was not at floor 
on this task, and most of the children were able to 
produce approximate imitations of at least some of 
the target nonwords. The mean score of all 14 chil- 
dren was 2.26 on the rating scale of 0 to 6 (SD, 0.93 
units). Although this sample of 14 children did con- 
tain a larger proportion of better-performing CI us- 
ers than was present in the larger sample of 43 chil- 
dren, these results suggest that some pediatric Cl 
users are able, like young children with normal hear- 
ing, to partially imitate unfamiliar sound sequences 
for which they have no lexical representation. 


The inter-rater reliability across our 10 naive lis- 
teners was very high (a > .90). Every listener’s av- 
erage perceptual rating (across the 20 items) for in- 
dividual children was correlated with an r value of 
.89 or higher with every other rater’s average rating 
for those same children (mean correlation, .94). These 
values indicate that our naive listeners were strongly 
in agreement regarding the performance of individual 
children on this task. 


In order to answer our second question, whether 
individual differences in the children’s speech percep- 
tion, working memory, and speech production skills 
would be reflected in their success at the nonword 
repetition task, we calculated a series of simple cor- 
relations between the nonword repetition scores and 
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TABLE 1. CORRELATIONS BETWEEN NONWORD REPETITION AND SPEECH-LANGUAGE MEASURES (N = 14) 








Initial Consonant 
Accuracy 


Perceptual Rating 
(Whole Item) 








Simple Simple 
Bivariate Partial Bivariate Partial 
r re r rm 
Encoding or perception 
Word identification, closed set, pointing response 
Word Intelligibility by Picture Identification (WIPI)!! 
(group mean + SD, 69.1% + 12.4%) 69% * 190% 44 42 
Word identification, open set, spoken repetition 
Lexical Neighborhood Test, Easy Words (LNTe)5 
(group mean + SD, 68.6% + 14.0%) 69*%* .61* .63* 59* 
Comprehension 
Test of Auditory Comprehension of Language—Revised? (TACL-R)!2 
(group mean + SD, 7.28 + 1.29 y) .60* 49 .65* 58 
Memory 
Phonological working memory, spoken repetition 
Wechsler Intelligence Scale for Children, Auditory Digit Span Subtest, 
Forward Recall of Digit-Name Lists!? (group mean + SD, 5.7 + 1.9 points)  .76** W E a TE ia 69% 
Production 
Speech intelligibility 
McGarr Sentences Intelligibility'* (group mean + SD, 79.6% + 16.8%) .10*#* BITR .68** TOWE 
Articulated sentence duration 
McGarr Average Duration of 7-Syllable Sentences!4 
(group mean + SD, 2.88 + 0.86 s) —.70** =.56 —.64* —.53 


Correlation between perceptual rating and initial consonant accuracy, r = .86. “Partial r” means age at onset and duration of deafness are 


partialled out. 
*p <.05. 
tke 1) 
+Administered via total communication, age-equivalent score. 





supplementary speech, language, and memory data 
gathered for another project by clinicians at Central 
Institute for the Deaf within a few days of the non- 
word repetition recordings.” These supplementary 
measures are briefly summarized in Table 1.511-14 
Nonword repetition performance was quantified in 
these calculations as either the mean perceptual rat- 
ing assigned to each child’s productions averaged 
over items and listeners (Table 1, at left) or the tran- 
scription-based initial-consonant accuracy score for 
each child averaged over items (Table 1, at right). 


The simple bivariate correlations shown in Table 


TABLE 2. CORRELATIONS BETWEEN NONWORD 
REPETITION PERFORMANCE AND DEMOGRAPHIC 
VARIABLES (N = 14) 








Initial 
Perceptual Consonant 
Rating (r) Accuracy (r) 





Age at test —14 05 
Age at onset of deafness 50 39 
Duration of deafness +58* —.41 
Duration of cochlear implant use 31 19 
Degree of exposure to oral-only 

communication environments ll _ 209 
No. of active electrodes =02 = O07 

*p <.05. 





l indicate that both measures of nonword repetition 
performance, mean perceptual rating and initial con- 
sonant accuracy, were strongly positively correlated 
with measures of word recognition, receptive lan- 
guage comprehension, auditory digit span, and speech 
intelligibility in meaningful sentences. In addition, 
we found that children who tended to speak more 
slowly when encouraged to produce their “best” spo- 
ken language tended to do less well on the nonword 
repetition task. 


Because we were concerned about the possible in- 
fluence of intervening variables on the size of the 
above relationships, we also calculated the simple 
correlations between a number of demographic vari- 
ables and the nonword repetition scores. These are 
shown in Table 2. In general, the only relationship 
of note was a sizable negative correlation with dura- 
tion of deafness, indicating that children who had 
been deaf for a longer period of time before implan- 
tation tended to do more poorly on the nonword rep- 
etition task. Moderate but nonsignificant positive cor- 
relations were also obtained between nonword rep- 
etition performance and age at onset of deafness (al- 
though there was very little variability in this demo- 
graphic variable within the sample). We therefore re- 
calculated the correlations between nonword repeti- 
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tion performance and word recognition, receptive lan- 
guage comprehension, auditory digit span, speech in- 
telligibility, and sentence duration with the co-vary- 
ing demographic factors of age at onset of deafness 
and duration of deafness statistically partialled out. 
As shown in Table 1, partialling out these demograph- 
ic variables, however, had very little effect on the 
size of the observed correlations. 


Although not shown in Table 1, we also examined 
whether the correlations between nonword repetition 
performance and each of the three component pro- 
cesses (ie, speech perception, working memory, and 
speech production) would shrink in size if represen- 
tative measures of the other two processing compo- 
nents were statistically partialled out. In these analy- 
ses, we found that the correlations between nonword 
repetition performance and working memory were 
the least affected by the partialling out of represen- 
tative outcome measures from the other two process- 
ing components. That is, the correlations between 
nonword repetition performance and digit span re- 
mained strong even when individual differences in 
word identification and speech intelligibility were 
statistically partialled out. This result suggests that 
the component process of working memory contrib- 
utes more unique variance to the observed individual 
differences in nonword repetition scores than do the 
speech perception and speech production components 
of the task. This latter finding should be viewed as 
preliminary in nature, given our very small sample 
size at the present time, but the general pattern is 
consistent with results obtained in previous studies 
with normally developing children.* 


Linguistic Analyses. The children’s imitations were 
next examined for generalizable patterns of phono- 
logical changes or distortions. The first linguistic 
analysis focused on the nature of these errors and 
how the errors were distributed among the children. 
The overall accuracy of the initial consonant aver- 
aged across children was 39%. The children’s scores 
were quite variable, ranging from 0% to 76% cor- 
rect. The voicing feature of the initial consonant was 
correctly imitated in an average of 67% of the non- 
word productions, with individual subject scores 
ranging between 29% and 88%. The manner feature 
of the initial consonant was accurately imitated in 
an average of 64% of the nonword repetitions, with 
scores ranging from 35% to 76%. Only 4 subjects 
scored at or below 60% on this measure. On aver- 
age, the children correctly imitated place in 59% of 
the nonwords; the scores ranged from 35% to 88%, 
with a relatively flat distribution between these ex- 
tremes. To summarize, voicing was imitated correctly 
most often, followed by manner, and then by place 
of articulation. The distributions of manner and voic- 


ing scores across the 14 children were more skewed 
in favor of correct imitation than was the distribu- 
tion of place scores. 


Reanalyzing these results in terms of the initial con- 
sonant of the target nonwords, we found that over- 
all, stops were imitated correctly more often than fric- 
atives. However, the 3 most accurately imitated word- 
initial segments were /t, d, s/, a set composed of 2 
coronal stops and a coronal fricative. The next 4 most 
accurately imitated word-initial segments were the 
noncoronal stops /p, b, k, g/, which were followed 
by the noncoronal fricatives /f, v/. This distribution 
indicates that targets that were coronal in terms of 
place were easier for the children to produce than 
targets that were labials or velars, regardless of man- 
ner. That is, coronal fricatives were produced, on av- 
erage, more accurately than labial and dorsal stops. 
This particular finding is not consistent with previ- 
ous reports in the literature suggesting that stops are 
more often correctly produced than fricatives or that 
labials are generally more often correctly produced 
than other consonants.!> The differences observed 
in this study may be a result of the inability of our 
participants to rely on visual cues to labial articula- 
tion because all of the nonwords were presented in 
an auditory-only format. Earlier reports that show 
better production of labial consonants may not re- 
flect pediatric CI users’ accurate auditory perception 
of labial sounds per se, but may instead reflect the 
availability of visual cues when audiovisual presen- 
tation methods are used in testing.!® 


Examination of the syllable production scores 
showed that overall, the children produced the cor- 
rect number of syllables in 66% of the imitations. 
The individual subject scores ranged from 30% to 
95% of responses produced with the correct number 
of syllables. When the imitations did not have the 
correct number of syllables’ the children tended to 
produce fewer syllables than were in the target non- 
word. Although the results for 1 particular test item 
inflated the error rate for 2-syllable nonwords, the 
children’s imitation of the number of syllables in each 
target tended to be correct more often for nonwords 
with fewer syllables. Specifically, 76% of the 3-syl- 
lable targets, 74% of the 2-syllable targets, 66% of 
the 4-syllable targets, and 49% of the 5-syllable tar- 
gets were imitated with the correct number of sylla- 
bles. The perceptual ratings of the nonword responses 
mirrored this syllable length effect. We found a reli- 
able negative correlation between the length of the 
intended target pattern and the perceptual rating for 
that nonword averaged across children (r = —.62; p < 
.01). Finally, the perceptual ratings were also corre- 
lated with the syllable production scores (r = .67; p 
< .01) — a finding suggesting that listeners gave 
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higher ratings to nonword imitations that preserved 
the correct number of syllables. 


DISCUSSION 


The present results indicate that these experienced 
pediatric CI users are able to use their existing lin- 
guistic knowledge to decode novel sound sequences. 
Within the sample studied, individual differences in 
the component processes of encoding, working mem- 
ory, and speech production were strongly reflected 
in nonword repetition scores, regardless of whether 
these scores were derived from a behaviorally based 
perceptual ratings measure using naive listeners or 
from a more formal transcription-based linguistic 
analysis of their responses. 


We are currently trying to identify the factors that 
naive listeners weight most heavily when asked to 
rate the accuracy of children’s nonword imitations. 
As evident in the above results, preserving the same 
number of syllables as present in the target stimulus 
is one key factor — a condition usually fulfilled by 
the speech productions of the children in this study. 
Detrimental to perceived accuracy are feature-based 
segmental errors such as those described for the chil- 
dren’s initial consonants. In producing initial conso- 
nants, the pediatric CI users were less consistent in 
maintaining the 3-way place contrast than the 2-way 





manner and voicing contrasts. Our linguistic analy- 
sis suggests that in the future it may be better to use 
amore phonologically balanced set of nonword stim- 
uli in order to investigate the favored production of 
coronal consonants over other places of articulation 
and the impact of the absence of visual cues on the 
perception of labial consonants. 


In summary, the results of this imitation study us- 
ing the nonword repetition task demonstrate that some 
children with CIs are, like normal-hearing children, 
able to use their knowledge about the phonological 
patterns present in their ambient language to repro- 
duce novel sound patterns. These children show evi- 
dence of being able to “decompose” nonsense words 
into familiar parts (ie, phonemes, phonetic segments), 
and rapidly “reassemble” or “translate” these elements 
into recognizable motor outputs despite the absence 
of learned lexical representations. This work is still 
preliminary and will require larger sample sizes be- 
fore more specific claims can be made. However, in 
light of these initial results, we suggest that it may be 
worthwhile to further investigate whether individual 
differences in the phonological processing skills as- 
sessed by nonword repetition can serve to predict, in 
part, real-word spoken vocabulary acquisition and 
language development in hearing-impaired children 
with CIs. 
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Speech perception outcomes for early-deafened children who undergo implantation as teenagers or young adults are generally 
reported to be poorer than results for young children, It is important to provide appropriate expectations when counseling adolescents 
and their families to help them make an informed choice regarding cochlear implant surgery. The considerable variation of results in 
this group makes this process more difficult. This study considered a number of factors in a group of 25 children who underwent 
implantation in Melbourne between the ages of.8 and 18 years. Each subject completed open-set speech perception testing with 
Bamford-Kowal-Bench sentences before and after implantation and preoperative language testing with the Peabody Picture Vocabu- 
lary Test. Data were collected regarding the type of hearing loss, age at implantation, age at hearing aid fitting, audiometric details, 
and preoperative and postoperative communication mode. Results were submitted to a stepwise multiple linear regression analysis 
with postoperative open-set sentence scores as the dependent variables. The analysis suggested that 3 factors have a significant 
predictive value for speech perception after implantation: preoperative open-set sentence score, duration of profound hearing loss, 
and equivalent language age. These 3 factors accounted for 66% of the variance in this group. The results of this study suggest that 
children who have useful speech perception before implantation, and higher age-equivalent scores on language measures, would be 
expected to do well with a cochlear implant. Consistent with other studies, a shorter duration of profound hearing loss is also 
advantageous. The mean sentence score for this group, 47%, was not significantly different from the mean result across all children 


in the Melbourne program. 


KEY WORDS — child, cochlear implant, predictive factor, speech perception. 


INTRODUCTION 


Many studies of the outcomes of cochlear implan- 
tation in children have identified the age of the child 
at the time of surgery as an important factor.!-7 It is 
generally accepted that younger children have an im- 
proved chance of speech perception, speech produc- 
tion, and language benefits from the application of 
implant technology. This rationale is often related to 
the concept of critical time periods for the develop- 
ment of spoken language, although the actual age 
intervals may vary for different skills and for differ- 
ent children.’ Despite acceptance that age may play 
a crucial role in the effective use of a cochlear im- 
plant, the wide range of results and other factors in- 
volved have sometimes tended to obscure this effect 
in clinical studies. 


A number of recent studies have identified age at 
implantation or duration of profound deafness (which 
amounts to the same thing for congenitally deaf chil- 
dren) as having a significant relationship to speech 
perception outcomes.!-? In general, children with 
congenital profound hearing impairment who under- 
go implantation as teenagers have not demonstrated 
the open-set speech perception abilities seen in many 





younger children who use cochlear implants. On the 
other hand, there have been cases of excellent speech 
perception results for older children.? Notably, chil- 
dren with progressive hearing loss who undergo im- 
plantation at a stage when the hearing loss has reached 
a profound level often perform well on speech per- 
ception tasks.!° It has been suggested that the early 
use of residual hearing by such children may pro- 
vide them with an advantage in processing auditory 
information from a cochlear implant. 


The change in the clinical population that is con- 
sidered for cochlear implantation may also be rele- 
vant to the effects of age on results. Because of im- 
proving outcomes, many children with a greater 
amount of residual hearing are being considered for 
implants. These children may have relatively good 
language and auditory processing skills in place, par- 


- ticularly as compared to children with profound-to- 


total congenital hearing loss. Their skills will depend 
not only on the level of residual hearing, but also on 
such factors as age of hearing aid fitting, level of 
family and educational support, and other individual 
attributes. 


For the reasons outlined above, it may be time to 
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SUBJECT DETAILS 
Variable Mean Range 
Age at implantation (y) 12.3 8.1-17.6 
Age aided (y) 2.2 0.7-7.0 
Duration of profound loss (y) 8.8 0.4-17.6 
Experience with cochlear implant (y) 2.7 0.5-7.5 
Pure tone average (best ear; dB) 112 82-130 





revisit the issues relating to the implantation of older 
children and attempt to identify some of the factors 
that could help in predicting outcomes. This paper 
will consider speech perception outcomes in particu- 
lar, although it is accepted that the effects of implan- 
tation on language and speech production are of at 
least comparable importance. 


METHODS 


Subjects. The study considered the speech percep- 
tion performance of 25 children who underwent im- 
plantation with the Nucleus multiple-channel cochlear 
prosthesis in Melbourne between 1988 and 1998. 
Their ages ranged from 8 to 18 years at the time of 
surgery. This sample represented 17% of the 145 chil- 
dren who underwent implantation during this time 
in the Melbourne program. The Table summarizes 
various demographic details of the subjects. 


Data Collection. Speech perception was assessed 
for all 25 subjects with the Bamford-Kowal-Bench 
(BKB) sentence test!! presented in a free field at 70 
dB sound pressure level with auditory input alone. 
Each list of BKB sentences comprises 16 sentences 
with 3 or 4 key scoring words per sentence. The score 
(out of 50) represents the number of key words cor- 
rectly identified. Responses were either spoken or 
written by the subjects. When speech production 
could have affected the interpretation of responses, 
the test sessions were videorecorded and the re- 
sponses were scored by 2 independent auciologists 
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Fig 1. Postoperative scores for Bamford-Kowa -Bench 
(BKB) open-set sentences test for 25 older children with 


multiple-channel cochlear implants in audition-alone 
condition. 
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Fig 2. Postoperative BKB open-set sentence scores for 
25 older children with multiple-channel cochlear im- 
plants. grouped according to preoperative scores. 


or speech pathologists. Subjects who used hearing 
aids, as well as their cochlear implant, had their hear- 
ing aids switched off during testing. Most of the sub- 
jects had a number of postoperative assessments, but 
only the most recent assessment (until September 
1999) was used in this study. All subjects had at least 
12 months of experience with the cochlear implant 
system. All subjects used the spectral peak (SPEAK) 
speech processing strategy! in a Spectra, Sprint, or 
Esprit processor (their take-home processor). The 
programming of the subjects’ speech processors was 
checked with standard clinical techniques before 
speech perception testing. All subjects used at least 
15 electrode combinations or channels in their speech 
processor MAPs. 


In addition to the recent speech perception scores, 
data were obtained from the subjects’ clinical files, 
including age at implantation, type of hearing loss 
(congenital profound, meningitis, progressive), age 
at hearing aid fitting, 3-frequency pure tone average 
hearing loss in the better ear at time of surgery, pre- 
implantation speech perception scores for open-set 
BKB sentences with hearing aids, the duration of pro- 
found hearing loss before implantation, the preim- 
plantation communication mode (exclusively oral or 
using some signing), and the preimplantation equiva- 
lent language age as estimated by the Peabody Pic- 
ture Vocabulary Test (PPVT).!3 


Data Analysis. A stepwise linear regression analy- 
sis was carried out with the postoperative open-set 
sentence scores as the dependent variable and other 
information, as detailed above, as possible predictor 
variables. Because of the large number of predictor 
variables, a correlation analysis was used initially to 
indicate those predictor variables that were signifi- 
cantly correlated with the postoperative speech per- 
ception scores. Only these variables were used in the 
multiple regression analysis. 


RESULTS 


The open-set sentence scores for each of the sub- 
jects are shown in Fig 1 along with the overall mean 
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Fig 3. Postoperative BKB open-set sentence scores for 
25 older children with multiple-channel cochlear im- 
plants, grouped according to duration of profound hear- . 
ing loss before implantation. 


for the group. Inspection’ of these data shows: that 
there are approximately equal numbers of subjects 
who score poorly (<10%) and subjects who score 


` very well (>80%), with very few scores in the middle 


range. This distribution leads to a rather misleading 
mean score of 47%. 


The stepwise multiple linear regression analysis 
suggested that 3 predictor variables accounted for 
significant amounts of variance in the postoperative 
BKB sentence scores. The open-set speech percep- 
tion before implantation had a positive association 
with the results after implantation. This relationship 
is illustrated in Fig 2, in which the subjects are 
grouped according to their preimplantation speech 
perception scores. 


The duration of profound hearing loss before im- 
plantation had a negative association with postopera- 
tive speech perception. This effect is illustrated in 
Fig 3, in which the subjects are grouped according 
to their duration of profound hearing loss. 


The equivalent language (vocabulary) age as mea- 
sured by the PPVT was positively associated with 
postoperative speech perception scores. This effect 
is illustrated in Fig 4, in which the subjects are 
grouped according to equivalent language age at the 
time of surgery. Note that Figs 2-4 do not necessar- 
ily provide an accurate description of the effect of 
the predictor variables, as interactions among these 
variables may enhance or decrease the apparent ef- 
fects in a 2-dimensional representation. For instance, 


Fig 5. Distribution of BKB open-set sentence scores 
for all children who underwent implantation in Mel- 
bourne and who had completed this assessment. 
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Fig 4. Postoperative BKB open-set sentencés scores for 
25 older children with multiple-channel cochlear im- 
plants, grouped according to preoperative equivalent lan- 
guage age. PPVT — Peabody Picture Vocabulary Test. 


it is possible that subjects with better speech percep- 
tion before implantation also have better language 
abilities. The correlations among the 3 -significant 
predictor variables did not reach a significant level, 
but it is likely that such interactions do exist in real- 
ity. Despite these qualifications, the trends illustrated 
in the Figures correspond in direction with the re- 
sults of the multiple regression analysis, and the 
analysis suggests that each of the variables has an 
independent effect on speech perception outcomes. 
The significant predictors accounted for 66% of the 
variance in the speech perception scores in this sam- 
ple. gis 


It is worth noting some of the variables that did 
not show any significant association with open-set 
speech perception. These included the hearing lev- 
els before implantation, as measured by the pure tone 
average in the better ear; the communication mode 
before implantation, with the subjects divided into 
those who used oral communication exclusively, and 
those who used some sign language; the type of hear- 
ing loss (congenital, meningitis, or progressive), and 
the age at hearing aid fitting. These variables have 
been identified as having significant associations with 
speech perception after implantation in previous stud- 
ies of younger groups. In this study, however, these 
particular measures did not show significant effects 
on speech perception results. 


DISCUSSION 
Comparison With All Children With Implants. This 
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investigation of a small group of older children with 
cochlear implants shows that speech perception out- 
comes can be poor in some cases, but excellent re- 
sults can be obtained for many others. The distribu- 
tion of BKB sentence scores for all children in the 
Melbourne program is shown in Fig 5. Note that not 
all children with implants had reached the age at 
which this assessment was appropriate. The mean 
score was 56%, which is not significantly different 
from the result for the group of older children. It does 
not appear to be valid to estimate the prognosis for a 
child cochlear implant candidate on the basis of age 
alone. 


Preoperative Hearing and Speech Perception. The 
most significant effect found in this study was the 
predictive effect of preoperative speech perception 
scores on postoperative scores. Children who scored 
above 20% for BKB sentences before operation tended 
to have speech perception scores close to 100% when 
using their implant system (Fig 2). On the other hand, 
the audiometric thresholds were not predictive of post- 
operative performance in this group. It seems that 
the ability to use minimal auditory information from 
hearing aids for speech perception carries over to ef- 
fective use of the implant. This ability is likely to be 
affected by many aspects of the child’s hearing loss, 
aid fitting, education, and cognitive abilities, and is 
not necessarily dependent on the preoperative audio- 
gram alone. 


Duration of Profound Hearing Loss. The duration 
of profound hearing loss has been identified in stud- 
ies of both adults and children to be associated with 
speech perception results with cochlear implants. In 
this study, too, we see a link between this parameter 
and the sentence results (Fig 3). In the extreme, this 
result reflects the differences between a teenager who 
has a congenital profound or total hearing loss and a 
child of a similar age who becomes deaf suddenly 
after having normal hearing and education for their 
early life. Between these extremes are children who 
may have had some progression in hearing loss over 
time such that their early opportunities for auditory 
learning were more favorable than those of others. 
The result supports the hypothesis that auditory dep- 
rivation can lead to detrimental effects on speech per- 
ception performance. It also is consistent with the 
idea that the development of language through audi- 
tion in early life will enhance the later use of audi- 


tory information from cochlear implants. 


Language Development. Vocabulary development 
as measured by the PPVT was related to speech per- 
ception outcomes in this study. Because the speech 
perception test used involved connected sentences, 
various aspects of language knowledge are bound to 
come into play in completing the task. These include 
both semantic and syntactic knowledge. Figure 4 in- 
dicates that children with equivalent language ages 
below 7.5 years tend to struggle with this sentence 
perception task. The result probably also reflects a 
general relationship between language development 
and speech perception ability, as demonstrated by 
Blamey et al!4 on a larger group of hearing-impaired 
children. Other studies have identified the mode of 
communication for children with cochlear implants 
as having a significant relationship with speech per- 
ception abilities,!:!° but this study did not indicate 
that the communication mode was a relevant factor 
for these older children. Vocabulary development ob- 
tained through the use of oral or sign language com- 
munication appeared to be sufficient to assist speech 
perception. 


CONCLUSIONS 


The results of this investigation suggest that for 
older children considering cochlear implantation, it 
is important to assess language development and the 
current use of residual hearing for speech percep- 
tion, and to have a detailed history of the progres- 
sion of the hearing loss. These factors can have sig- 
nificant effects on speech perception outcomes, and 
the information may help to provide more useful 
counseling to candidates and their families. The es- 
timation of outcomes in the preimplantation phase 
for this group is particularly important, as results do 
vary Over an extreme range. 


This study provides support for the widening of 
indications for cochlear implantation, in terms of de- 
gree of hearing loss. On the other hand, the consid- 
eration of children with useful residual hearing brings 
with it an obligation to assess perceptual performance 
with hearing aids in a rigorous manner. In addition 
to standard testing, an evaluation that considers each 
ear separately and considers the binaural advantage 
for speech perception in both quiet and background 
noise appears necessary for children with useful aided 
hearing in both ears. 
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INFLUENCE OF AGE AT IMPLANTATION AND OF RESIDUAL HEARING 
ON SPEECH OUTCOME MEASURES AFTER COCHLEAR 
IMPLANTATION: BINARY PARTITIONING ANALYSIS 


HAMDY EL-HAKIM, MD 
MOHAMED ABDOLELL, MSC RICHARD J. MOUNT, MLT 


BLAKE C. PAPSIN, MD, FRCSC ROBERT V. HARRISON, PHD, DSC 
TORONTO, CANADA 


The goal of this study was to evaluate speech understanding outcomes in prelingually deaf children who use a cochlear implant 
device. Specifically, we discuss investigations on 2 prognostic variables — age at implantation and degree of residual hearing — and 
use a novel method: binary partitioning analysis. Our outcome measures are standard speech perception evaluations, including the 
Word Intelligibility by Picture Identification (WIPI) test, the Phonetically Ba anced—Kindergarten (PBK) test, and the Glendonald 
Auditory Screening Procedure (GASP). Regarding age at implantation, we definitely showed that growth rates of speech understand- 
ing do relate to age at implantation, but not in a simple fashion. We used binary partitioning in an attempt to find the age at implan- 
tation that best separates the performances of children with younger versus olcer ages at implantation. We found that there is no one 
“critical age”; much appears to depend on the nature and difficulty (eg, whether open- or closed-set) of the test used. Regarding 
residual hearing, binary partitioning analysis was unable to show that the amcunt of residual hearing (as shown by preimplantation 
audiometric data) has any significant bearing on speech outcome measures in congenitally or prelingually deaf children. 


KEY WORDS — age at implantation, binary partitioning, cochlear implant, congenital hearing loss, critical period, outcome 
measure, prelingual deafness, residual hearing, speech perception. 


INTRODUCTION 


There remains considerable interest in identify- 
ing prognostic variables that have an impact on func- 
tional outcomes of cochlear implantation in children. 
Such information is potentially useful for refining 
candidacy criteria and for improved counseling of 
parents and patients. Numerous investigations of co- 
chlear implant outcomes in prelingually deaf chil- 
dren have studied factors such as communication 
mode, age at implantation, age at onset of deafness, 
and processor type. 


The scientific question in these contexts is most 
often concerned with confirming (or refuting) associ- 
ations between the variable(s) and the outcome mea- 
sure(s) chosen. The nature of such a question lends 
itself to standard multivariate analysis methods, de- 
spite the constraints related to sample size, incom- 
plete data, or changing confounding variables over 
time. Other studies have tried to identify thresholds 
or cutoff points of certain variables at which outcome 
may be altered to a significant degree. For this type 
of question, the available knowledge base cannot pro- 
pose an a priori hypothesis — a problem not infre- 
quently encountered in biomedical research, Although 


frequently overlooked in the medical literature and 
rarely used in cochlear implant outcomes research, 
there are statistical models that resolve the problem, 
for example, tree-structured methods for longitudinal 
data! and recursive partitioning methods. 


In the present study, we use a related technique, 
binary partitioning analysis, to explore the effects of 
2 variables on hearing outcomes in children with co- 
chlear implants, namely, age at implantation and re- 
sidual hearing. The outcome measures used were a 
range of standard speech understanding tests. This 
is a retrospective analysis of postimplantation speech 
outcomes. The binary partitioning algorithm? splits 
a set of longitudinal data (the serially collected re- 
sults of each speech test) into 2 mutually exclusive 
groups based on an optimal split of the continuous 
prognostic variable (in this case, age at implantation 
and residual hearing). Our aim was to search for any 
significant points (perhaps benefit-limiting thresh- 
olds) along the continuum of age at implantation and 
along the continuum of residual hearing. The study 
demonstrates the potential of the binary partitioning 
algorithm in analyzing cochlear implant outcome 
measures. 
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MATERIALS AND METHODS 


Patient Population. The children studied under- 
went implantation between 1988 and 1999 and were 
followed within the Cochlear Implant Program at the 
Hospital for Sick Children, Toronto, Canada.‘ At the 
time of analysis, information on 133 children was 
available; however, only prelingually deaf children 
who underwent the speech perception tests described 
below were considered for this exploratory retrospec- 
tive study. Of these, we included only the patients 
who had performed the tests 2 or more times after 
implantation. Preimplantation test scores are not in- 
cluded in this analysis. 


The characteristics of the experimental group were 
as follows. Ideally, we would have all children per- 
form all of the tests an equal number of times; how- 
ever, because of practical constraints, this has not 
been achieved. The numbers of subjects for each 
speech test used were as follows: the Glendonald Au- 
ditory Screening Procedure (GASP) word test, 82 
patients; the GASP sentence test, 50 patients; the Pho- 
netically Balanced—Kindergarten (PBK) phoneme 
test and PBK word test, 69 patients; and.the Word 
Intelligibility by Picture Identification (WIPI) test, 
58 patients. The following describes the population 
for the largest group (GASP word test; n = 82). The 
age of patients at implantation ranged from 1.9 to 
15.4 years (mean, 6.3 years). There was an equal 
(41:41) gender distribution. Follow-up periods ranged 
from 6 months to 5 years (mean, 2.6 years). Most 
children (n = 69) used oral communication before 
implantation, and 13 used total communication. Re- 
garding onset of hearing loss, the group was relatively 
homogeneous. All children had a hearing loss be- 
fore 3 years of age; most hearing losses (n=57) could 
be classified as congenital, 16 hearing losses had 
some clear genetic origin, and 5 children had senso- 
rineural hearing loss resulting from meningitis. 


All children had severe to profound sensorineural 
hearing loss, with (for the implanted ear) average 
thresholds across all tested frequencies ranging from 
97 to 120 dB. In order to treat residual hearing as a 
continuous prognostic variable, we performed a trans- 
formation. For the implanted ear, the mean audio- 
metric pure tone threshold was determined at all test 
frequencies, and a percentage value was then calcu- 
lated by considering that a 120-dB loss (or immea- 
surable threshold) equates to 0% residual hearing. 
Thus, for example, a subject with an average thresh- 
old hearing loss of 100 dB across all frequencies 
would have “residual hearing” of 16.7%. l 


Cochlear Implant Device and Habilitation. The 
Nucleus multichannel cochlear implant (Cochlear 
Ltd; Lane Cove, Australia) was used in all children. 


All implants were programmed with the spectral peak 
(SPEAK) coding strategy as of 1994; earlier, the mul- 
tipeak (MPEAK) strategy was used. Thechildren were 
in various types of habilitation programs, as well as 
various educational placements, before and after im- 
plantation. For some of the children, there were 
changes in educational placement over the course of 
the years of study. 


Speech Tests. A closed-set test, the WIPI test as- 
sesses the ability to identify monosyllable words by 
audition alone. The test consists of 25 sets of col- 
ored pictures. Each set of 6 pictures elicits 4 words 
that rhyme and 2 others that are foils to decrease ele- 
vated scores as a result of ‘guessing. The child points 
to the correct response. 


For open-set speech perception tests, we used the 
PBK word list. This test evaluates the child’s ability 
to recognize single-syllable, phonetically balanced 
words (50 words). Half-lists of 25 words are used, 
and scores indicate the numbers of both phonemes 
(PBK phoneme) and words (PBK word) correctly 
identified. We also used the GASP, which consists 
of word and sentence subtests. The child has to re- 
peat 12 words and 10 questions. All responses in the 
tests are marked as percentages of the total score. 
The tests were administered to all children before 
implantation, then at 6-month intervals for 24 months 
after implantation, and annually thereafter. Note, how- 
ever, that the preimplantation test scores were not 
used in this study. 


Outcome Measures and Analysis. Age at implan- 
tation and residual hearing were each considered as 
prognostic variables, and the 5 speech perception tests 
as outcome variables. The following steps were per- 
formed for each. 


First, a binary partitioning algorithm was imple- 
mented that systematically demonstrated all possible 
binary splits of the data of each speech test. Thus, on 
the basis of age of implantation, or on the basis of 
degree of residual hearing, the splits attempt to parti- 
tion the patients into those who showed more or less 
progression of benefit in terms of improving score 
on the speech understanding tests. 


Second, a deviance (reduction) was calculated for 
every split. An “optimal” or best split is one that max- 
imizes the change in deviance achieved. In other 
words, the best split point reduces the heterogeneity 
of the data more than the other split points do (the 
goodness-of-split criterion). The. optimal split point 
can be validated by a permutation test that circum- 
vents the issue of multiple testing, ie, the repeated 
use of various statistical tests on a single data ‘set 
(which some statisticians find unacceptable). 
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Fig 1. Outcomes of Phonetically Balanced-Kindergarten word test after cochlear implantation. Left-hand plots — binary 
partitioning analysis with age at implantation as prognostic variable; right-hand plots — degree (in percent) of residual hear- 
ing as prognostic variable. Larger plots show drop in deviance for data splits according to age at implantation (abscissa, left 
graph) and on basis of percentage of residual hearing (right-hand plot). Small lower plots show graphical representations of 
repeated-measures analyses for partitioned data before (dotted curve) and after (continuous curve) divisions as indicated by 


dotted lines in main plots. 


Finally, we performed a repeated-measures analy- 
sis guided by the results of the binary partitioning 
analysis. Thus, to further illustrate the effect of the 
optimal splits, we used this more familiar analysis 
in selected examples with graphic representations. 
Statistical significance (p value) was set at the tradi- 
tional level of .05. It is important to note that this 
last step is not integral to the binary algorithm’s sta- 
tistical methodology, and the & value here is not taken 
as a measure of acceptance or rejection of the null 
hypothesis. 


RESULTS 


Figure | illustrates the application of binary parti- 
tioning analysis to the PBK word outcome measure 
(n = 69) for both of the prognostic variables under 
consideration. The left-hand panels show age-at-im- 
plantation data, and those on the right show residual- 
hearing data. The upper left panel plots the drop in 
deviance (vertical axis) for each split of the data based 
on age at implantation (horizontal axis). In this case, 
the best split, producing a drop in deviance of 85.2, 
is at the age of implantation of 8.4 years. Dotted ver- 
tical lines mark this best split and 2 other splits of 
the data, for ages at implantation of 4 and 10 years 
(chosen semirandomly at an age above and an age 
below the optimal age-at-implantation split). The low- 
er panels show repeated-measure analyses of vari- 
ance (ANOVA) corresponding to these 3 example 
splits of data. The continuous curves show the pro- 
gress (on PBK word test) over time after implanta- 


tion in children who underwent implantation before 
the age split of interest, compared with those who 
underwent implantation after it (dotted curve). The 
comparison of data split before and after 8.4 years 
(the best split of the data) clearly illustrates how sig- 
nificantly better (p < .001) the average performance 
is for children who underwent implantation at a 
younger age than for those who underwent implan- 
tation at a later age. 


By way of contrast, the right-hand panels show 
the binary partitioning analysis based on the degree 
of residual hearing. Here, there is no clear “best split” 
of the data. The drop in deviance for all splits is rela- 
tively low (in the range of 30 to 50). The lower pan- 
els show repeated-measures ANOVA of data split at 
residual hearing values of 4%, 11%, and 15% (cho- 
sen semirandomly to provide illustrative examples). 
Note how little disparity there is between data split 
at these residual hearing values. 


Age at Implantation. Figure 2 summarizes in graph- 
ical form the binary partitioning analysis for all of 
the speech tests used. In each panel, the drop in de- 
viance is plotted for all splits of data according to 
age at implantation. In the lower right-hand panel, 
we have superimposed data from all 5 test outcomes 
as indicated by the key. For each test, the data split 
that produces the maximum drop in deviance is in- 
dicated by an arrow. The inset plot shows graphically 
the performance of children who underwent implan- 
tation before (dotted line) and after (continuous line) 
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Fig 2. Graphical representations of binary parti- 80 
tioning analysis of age-at-implantation data. Drop- 
in-deviance plots for each speech outcome test are 
shown. Arrows indicate age-at-implantation split 
that produces maximal drop in deviance. Inset 
plots, based on repeated-measures analyses, illus- 
trate average postimplantation speech results for 
children who underwent implantation before (dot- 
ted curve) and after (continuous curve) age at im- 
plantation of interest. Lowest right-hand plot shows 
superimposed data. 0 


Drop in Deviance 
S 


Drop in Deviance 


the age-of-implantation split that was derived by a 
repeated-measures analysis. For all of the speech tests 
used, there is a significant difference (ANOVA) be- 
tween the performance of children who underwent 
implantation before the optimal split and that of chil- 
dren who underwent implantation after the optimal 
split. Table 1 summarizes the results, indicating the 
optimum age-at-implantation split, the drop in devi- 
ance, and the significance level of the difference in 
outcome for each of the tests used. 


It is important to note that we have thus far only 
considered data splits that produce the maximum drop 
in deviance; there may be other data splits according 
to age at implantation that also produce significant 
differences in performance (before versus after im- 
plantation). 


The pooled data of the lower right panel of Fig 2 
reveal some of the common elements between the 
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patterns of the “drop in deviance” plots for each of 
the tests. We note that the PBK word and PBK pho- - 
neme patterns are very similar, particularly in peak- 
ing at an implantation age of 8.4 years and in having 
some other common “peaks.” One “subpeak” at about 


TABLE 1. SUMMARY OF RESULTS OF USING AGE AT 
IMPLANTATION AS PROGNOSTIC VARIABLE 














Age-at- 

Implantation Drop in pon 
Test N  Split(y) Deviance ANOVA 
PBK word 69 8.4 84.8 <.05 
PBK phoneme 69 8.4 719.8 <.001 
GASP word 82 5.6 70.9 <.005 
GASP sentence 50 5.6 63.5 <.005 
WIPI 58 43 56.4 <.05 


ANOVA — analysis of variance, PBK — Phonetically Balanced- 
Kindergarten, GASP — Glendonald Auditory Screening Procedure, 
WIPI — Word Intelligibility by Picture Identification. 
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5.6 years corresponds to the maxima noted for GASP 
sentence and GASP word data, and is also a “‘sub- 
peak” for the WIPI data. All of the “drop in devi- 
ance” curves share a common “shoulder” at about 
3.5 years of age, which most likely reflects the bound- 
ary age before which children have difficulty in per- 
forming the tests. 


Residual Hearing. As shown in Fig 1 (right-hand 
panels), binary partitioning analysis based on the de- 
gree of residual hearing shows no obvious distinc- 
tions in speech understanding (PBK word test) for 
children who had different degrees of residual hear- 
ing before undergoing cochlear implantation. The data 
for all of the tests are graphically illustrated in Fig 3. 
Each panel plots the drop in deviance that appears 
when data are split according to percent residual hear- 
ing. In the lower right-hand panel, all plots are su- 
perimposed for comparison. In none of these plots is 


Residual Hearing (%) 


there a clear peak. As with the age-at-implantation 
data, we have indicated (arrow) the maximum drop- 
in-deviance value, and on the basis of these splits, 
we performed a repeated-measures analysis. The in- 
set graphs thus illustrate the average performance 
over time of children with less (dotted curve) and 
more (continuous line) residual hearing about the split 
value. In no case is there any significant difference 
(ANOVA) between the groups (p > .05). It can be 
noted that for the GASP sentence data (top right-hand 
panel), there is a gap in the average performance over 
time between the 2 groups. However, the rates of 
change are not different (the curves are more or less 
parallel), and statistically, there is no significant dif- 
ference. Indeed, there is no significant difference in 
the performances over time at any split based on re- 
sidual hearing. The data are summarized in Table 2. 


The superimposed plots (Fig 3, lower right) show 
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TABLE 2. SUMMARY OF RESULTS OF USING 
RESIDUAL HEARING AS PROGNOSTIC VARIABLE 

















Residual- 

Hearing Drop in pon 
Test N Split(%) Deviance ANOVA 
PBK word 69 11 51.2 >.05 
PBK phoneme 69 6 50.8 >.05 
GASP word 82 16 55.9 >.05 
GASP sentence 50 l1 59.0 >.05 
WIPI _ 58 5 51.4 >.05 





that there are no common features to the “drop in 
deviance” plots, except a “shoulder” at the lower end 
(approximately 4%) of the residual hearing scale. 


This split could divide the patients with essentially., 


no hearing from those with some degree of residual 
hearing. In any case, it does not distinguish 2 groups 
that are significantly different in terms of perfor- 
mance on the tests used. 


DISCUSSION 


Age at Implantation. The question of the effects 
of age at implantation on outcomes is one of the most 
widely researched issues in the pediatric cochlearim- 
plant field.>!8 For congenitally deaf children, con- 
ventional wisdom would advise us that age at im- 
plantation is an important prognostic variable. The 
biological bases are postulated.to be the influences 
of early periods of sensory deprivation, together with 
the possible existence of.“‘critical periods” in the es- 
tablishment of auditory mechanisms, including speech 
understanding and language development. 19:20 How- 
ever, many questions remain unanswered regarding 
the limits of any critical period, or whether several 
periods exist.?! 


Many prior investigations into age at implantation 
and its impact on speech outcomes used comparisons 
between groups of patients divided on the basis of 
an arbitrary age at implantation, or compared groups 
at the extremes of the age-at-implantation range: 
There are often no clear scientific assumptions to sup- 
port the choices of groups to compare. Our own co- 
chlear implant group has previously published out- 
come measure comparisons in which we-arbitrarily 
divided the patients according to whether they un- 
derwent implantation before or after 6 years.of age.?? 


The method chosen in this study allows the data 
to select their own best splits, and inherently circum- 
vents some of the concerns about missing data and 
about multiple testing. One of our important results 
is the appearance of a number of peaks on the drop- 
in-deviance plots, most obviously shown in the pooled 
data of Fig 2 ower right panel). Thus, the algorithm 
shows not only the optimal split of the data, but also 
other splits that might produce significant results on 


multiple statistical testing. One could surmise that 
multiple peaks occur on the different tests because 
different speech tests tap differently evolving aspects 
of speech understanding. In other words, different 
cutoff points measure different aspects of speech per- 
ception. Our results could explain previous findings 
by different authors of statistically significant results 
using different age-at-implantation cutoff points. The 
multiple thresholds that are seen could be a manifes- 
tationof the presence of multiple critical periods. The 
concept of a staggered critical period with an early 
component related to semantics development and a 
later component related to syntax development has 
been proposed by Ruben.”! Some neurophysiologi- 


- cal studies?3 support the notion that there is a frac- 


tionation of language development, with different 
neural subsystems having different sensitive periods. 
Our data could be seen to support such a scheme. 
More difficult tasks, tested by open-set tests, are as- 
sociated with a later age-at-implantation split (eg, 
8.4 years for PBK tests), compared with earlier opti- 
mum splits for more simple, closed-set speech under- 
standing tasks (4.3 years for WIPI, 5.6 years for 
GASP). Such interpretations of the present data are 
somewhat speculative. However, it is clear that the 
different binary partitioning (drop in deviance) plots 
for each speech test reflect the specific cognitive tasks 
required for each test and/or are a manifestation of a 
multistage speech development process. 


Residual Hearing. Our second focus was on resid- 
ual hearing. Do children with implants who have little 
or no residual hearing perform differently over time 
on speech understanding tasks than those with, for 
example, 20% residual hearing? Could it be that chil- 
dren with good residual hearing have more spiral gan- 
glion cell survival than those with completely dam- 
aged sensory epithelia, and thus have a better neural 
substrate for electrode activation? From a practical 
perspective, issues relating to residual hearing have 
relevance in relation to the candidacy of patients, and 
to right versus left ear choice for an implant on the 
basis of audiometric status. However, despite the im- 
portance of residual hearing, this prognostic variable 
has been relatively underresearched.24,25 


Our binary partitioning analysis indicates that there 
are no differences in the speech outcome measures 
between children with little or no hearing and those 
with as much as 20% residual hearing. This finding 
is somewhat contrary to conventional wisdom, which 
supposes that a more “intact” auditory system would 
offer a superior neural substrate for the electrically 
induced excitation patterns that a cochlear implant 
provides. However, 2 important factors about the 
present study need to be emphasized. First, the sub- 
ject population included only congenitally or prelin- 
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gually deaf children and did not include those with 
progressive loss who ultimately met the audiologi- 
cal criteria for implantation. Hence, there were no 
older children with good speech perception perfor- 
mance based on early serviceable hearing to bias the 
sample in favor of a good correlation between resid- 
ual hearing and speech outcomes. Second, we did not 
include preimplantation test scores, which would con- 
taminate the rate of postimplantation performance 
with an initial value that indicates little about per- 
formance with an implant and is often highly corre- 
lated with audiometrically measured residual hear- 


ing ability. 


Binary Partitioning Analysis. One of the aims of 
this study was to apply binary partitioning methods 
to cochlear implant outcome measures. We examined 
2 prognostic variables: age at implantation and de- 
gree of residual hearing. As to the former, our analy- 
sis revealed that speech understanding, based on the 
specific tests used, does relate to age at implanta- 
tion, but not in any simple fashion. In contrast, the 
algorithm used to analyze the effects of residual hear- 
ing showed a negative result. 
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GRAMMATICAL MORPHOLOGIC DEVELOPMENT IN PEDIATRIC 
COCHLEAR IMPLANT USERS MAY BE AFFECTED BY THE 
PERCEPTUAL PROMINENCE OF THE RELEVANT MARKERS 
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The goal of this study was to test 2 hypotheses about language development in pediatric cochlear implant (CI) users. The 
“language instinct” hypothesis states that children with CIs will develop language in the same sequence as children with normal 
hearing, but in a delayed fashion. In other words, noun plurals will develop first, and the use of the uncontractible copula and regular 
past tense will follow. An alternative hypothesis (the “perceptual prominence” hypothesis) is that the pattern of language develop- 
ment in CI users will be strongly affected by the perceptual prominence of the relevant morphological markers. This hypothesis 
predicts that the uncontractible copula will develop first, followed by noun plurals, and then by regular past tense. A sentence 
completion task was used to measure the performance of 9 pediatric CI users and compare it to that of several groups of children with 
normal hearing. The results from the CI users were consistent with the perceptual prominence hypothesis. In particular, the scores for 
the copula probe were higher than those for the noun plural probe for 8 of the 9 CI users. This result represents a rather striking 
inversion with respect to the usual development pattern in children with normal hearing and even in children with specific language 
impairment. If the perceptual prominence hypothesis receives further support in future studies, clinicians who work in language 
rehabilitation of CI users may choose to target those aspects of grammar that are less acoustically prominent to these children. In 
addition, and from a theoretical standpoint, these results suggest that although there may well be an innate language acquisition 


mechanism, patterns of language development can be strongly affected by the acoustic input. 
KEY WORDS — cochlear implant, grammatical morphology, language development, pediatrics. 


INTRODUCTION 


Studies have found that profoundly prelingually 
deaf children develop English language skills at a 
slower rate than children with normal hearing.! When 
they receive cochlear implants (CIs), their English 
language skills start developing (on average) at a nor- 
mal rate, and the mean language delay shown at the 
initial activation of the CI remains approximately con- 
stant.2 However, these studies do not tell us whether 
English language development in pediatric CI users 
follows the same sequence and pattern as in normal- 
hearing children (the “language instinct” hypothesis), 
or a different sequence, perhaps one that may be af- 
fected by the perceptual deficits suffered by CI us- 


ers (the “perceptual prominence” hypothesis). The. 


language instinct hypothesis is so named because it 
is an extrapolation of the ideas popularized by Steven 
Pinker? in the book of the same name. It postulates 
that language development is conceptually similar 
to the patterns observed in the normal development 
of gross motor skills in children, in whom certain 
landmarks almost always occur in a certain sequence: 
children first learn to hold the head upright, then the 





torso, then crawl, stand upright, and walk. An alter- 
native hypothesis, which we call the “perceptual 
prominence” hypothesis, is that CI users develop 
grammatical skills in a sequence that is determined 
by the perceptual prominence of the corresponding 
acoustic markers. In this study, we measured the per- 
formance of pediatric CI users on 3 probes designed 
to measure their proficiency in the use of 3 aspects 
of grammatical morphology: noun plurals, the uncon- 
tractible copula (“is” or “are’’), and regular past tense. 
The results were found to be fundamentally differ- 
ent from those obtained in 3 groups of participants: 
2 groups of normally developing hearing children (a 
group 2 years 5 months to 3 years 3 months of age 
and a group 3 years 6 months to 5 years 8 months of 
age) and a group of normally hearing children (3 
years 7 months to 5 years 9 months of age) with spe- 
cific language impairment, a term used for children 
who exhibit a language deficit despite normal hear- 
ing and normal nonverbal cognitive abilities. 


METHODS 
Participants. In this study, we included 9 pediat- 
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TABLE 1. DEMOGRAPHIC INFORMATION FOR 
COCHLEAR IMPLANT GROUP 


TABLE 2. DEMOGRAPHIC INFORMATION FOR 
NORMAL-HEARING GROUP 











Mean Range Mean Range 

Age at profound deafness 8 mo Birth to 2 y SLI group 

5 mo Age at probe testing 4y9mo 3 y 7 moto 

Age at implantation 3 y 8 mo 1 y 9 mo to 5 y 9 mo 

6y 11 mo PLS standard score (N = 1) 83 
Age at probe testing 6yl0mo 4y 5 moto TOLD standard score (N =8) 76 64 to 83 
. 8 y 11 mo ND-young group 

Peabody Picture Vocabulary 5y7mo 4y5moto Age at probe testing 3 SSmo to 
Test (age-equivalent score) 8 y 9 mo s “a= ne 3 3mo 

Reynell Developmental Language 4y Ilmo 4yto7y PLS standard score (N = 9) 98 89 to 112 
Scales (receptive; age- i 
equivalent score) ND-old group ; 

Reynell Developmental Language Sy7mo 4y1moto Age at probe testing 4y 10mo > y 6 mo to 
Scales (expressive; age- 7y 5y 8 mo 
equivalent score) PLS standard score (N = 1) 103 

Phonetically Balanced— 54 12 to 76 TOLD standard score (N=8) 102 91 to 116 


Kindergarten List (% correct) 





ric CI users who were participating in ongoing stud- 
ies of pediatric CI use at the Indiana University 
School of Medicine. The inclusion criteria were pre- 
lingual onset of deafness and age-equivalent scores 
on the Peabody Picture Vocabulary Test* and/or Rey- 
nell Developmental Language Scales> between 36 
and 66 months at the prior testing interval, which 
had taken place 6 months before testing for the pres- 
ent study. This range was selected because our pilot 
studies indicated that children within this range of 
language age were most likely to display levels of 
performance that were neither at ceiling (ie, 100% 
correct responses for all 3 probes) nor at floor (0%). 
At the time of testing, 6 of the 9 children had 4 to 
4!/2 years of implant use, and the other 3 had 1 to 2 
years of implant use. On average, their age at pro- 
found deafness was 0 years 8 months, and the age at 
the time of testing was 6 years 10 months. Table | 
includes their demographic information, their age 
equivalent scores for the Peabody Picture Vocabu- 
lary Test and the Reynell Development Language 
Scales, and their percent correct scores on the Pho- 
netically Balanced—Kindergarten (PBK) test® of 
speech perception at the time of testing. We com- 
pared the CI users’ performance to that of 3 groups 
in a study by Leonard et al’ of grammatical develop- 
ment in children with specific language impairment 
(SLI). All 3 groups consisted of 9 children. The first 
group exhibited signs of SLI and ranged in age from 
3 years 7 months to 5 years 9 months. The 2 groups 
matched with the SLI group on the basis of age and 
mean length of utterance ranged in age from 3 years 
6 months to 5 years 8 months and from 2 years 5 
months to 3 years 3 months, respectively. These 
groups will be referred to as ND-old (ND for “nor- 
mally developing’) and ND-young, respectively. 
Table 2 includes their demographic information and 


SLI — specific language impairment; PLS — Preschool Language 
Scale; TOLD — Test of Language Development; ND — normally 
developing. 





their standard scores for either of 2 tests showing 
their level of speech perception at the time of test- 
ing: the Preschool Language Scale—3 (PLS-3)8 or the 
Test of Language Development—Primary: 2 (TOLD- 
P:2).? The ND-young group was not tested with the 
past tense probe in the original study, but their re- 
sults with the copula and the noun plural probe still 
serve as a basis for comparison to our results. 


Tasks. To test the language instinct hypothesis ver- 
sus the perceptual prominence hypothesis in pediat- 
ric Cl users, 3 morphological probes used by Leonard 
et al’ were administered to the children: noun plu- 
rals, uncontractible copula, and regular past tense. 
The probes were sentence completion tasks that pro- 
vided obligatory contexts in order to elicit the gram- 
matical morpheme in question. For example, in the 
noun plurals probe, the children saw a picture of a 
doll and a picture of some blocks. The examiner read 
a sentence, “Here is a doll, and here are...” to elicit 
the child's production of the target word “blocks.” 
The noun plurals probe consisted of 15 trials elicit- 
ing 12 plural and 3 singular forms of regular words 
(eg, “Here is 1 marble, and here are 3...[marbles]” 
and “Here are some monkeys, and here is...[a tiger]”’) 
and 4 trials eliciting plural forms of nonsense words 
(eg, “This is a doyn. These are 2...[doyns]”).!° The 
uncontractible copula probe consisted of 18 trials (9 
eliciting the copula “is” and 9 the copula “are”), with 
16 of the 18 trials eliciting the same copula form as 
that used by the examiner in the prompt (eg, “The 
cat is little, but the... [horse is big]””). Two of the items 
used different copula forms in the examiner’s prompt 
and the child’s response to keep the child from merely 
repeating the examiner’s prompt. Twelve regular past 
items were divided evenly between final /t/ and /d/ 
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Average percentages of correct responses in noun plural, copula, and past tense probes for A) 3 groups of children with normal 
hearing, including both normally developing groups (old — 3 y 6 mo to 5 y 8 mo; young — 2 y 5 mo to 3 y 3 mo) and group 
with specific language impairment (SLI), and B) children with cochlear implants (CI). 





sounds (eg, “zipped” versus “emptied”). 


Analyses. Accurate assessment of a child’s ability 
to produce regular plural and past tense words is de- 
pendent on a child’s ability to actually produce the 
plural sounds, /s/ and /z/, and the regular past sounds, 
/t/ and /d/, in word-final position. An inability to ar- 
ticulate such sounds renders scoring of such probes 
impossible. Consequently, the speech samples of the 
CI users were analyzed. All children in this sample 
demonstrated the ability to produce final /t/, /d/, and 
/s/ sounds. Three of the children were unable to pro- 
duce final /z/, but they did mark plurals with a final 
/s/, which was accepted as a correct response, as the 
goal of the noun plural probe was to measure their 
knowledge of noun plurals rather than their knowl- 
edge of English phonotactics. 


Each of the probes was scored and video record- 
ings of the sessions were double-checked for accu- 
racy by a second coder. The children’s scores were 
recorded as the percentage of items that were cor- 
rectly produced. When children provided a non-tar- 
get word response, their response was still scored if 
the response appeared in the same target form (eg, 
another regular past verb was provided in the regu- 
lar past probe). However, if the response varied from 
the target form (eg, an irregular verb form for a regu- 
lar past form), the item was excluded from the analy- 
ses. Not more than 2 items had to be excluded for 
this reason from any probe for any participant. 


The percentages of correct responses were ana- 
lyzed with 2-way repeated-measures analysis of vari- 
ance, with the 2 factors being “probe” (indicating 
which 1 of the 3 probes the data refer to) and “group” 
(indicating whether the child was part of the CI group 
or one of the comparison groups). The Tukey test 
was used to perform post hoc pairwise comparisons. 


RESULTS 


The Figure, A, shows results for the 3 comparison 
groups, which include the young and old normally 
developing children, as well as the children with SLI. 
In all 3 groups, the percentages of correct responses 
in the noun plural probe were higher than those in 
the copula probe, although this result was obscured 
by a ceiling effect in the ND-old group. In the 2 
groups in which the past tense probe was adminis- 
tered, the percentages of correct responses for this 
probe were lower than for the noun plural or copula 
probes. The Figure, B, shows the results for the group 
of CI users. In contrast to the comparison groups, 
the percentage of correct responses was higher for 
the copula probe than for the noun plural probe. Per- 
formance on the past tense probe was worse than that 
on the copula or noun plural probes — the same re- 
sult that was observed in the comparison groups. 


The analysis of variance revealed a significant in- 
teraction (p < .01) between the probe and group fac- 
tors. In other words, results on different probes de- 
pend on whether the participant belongs to the Cl 
group or to one of the comparison groups. Pairwise 
comparisons showed that performance on the noun 
plural probe was higher than that on the copula probe 
for children in the comparison groups (p < .001), but 
there was no significant difference in the CI group. 


DISCUSSION 


To determine the relative performance on the 3 
probes that would be predicted by the perceptual 
prominence hypothesis, it is necessary to make well- 
informed hypotheses about the relative perceptual dif- 
ficulty of the relevant morphological markers. Regu- 
lar past tense in English is marked by the addition of 
a final /t/ or /d/, both of which are characterized by a 
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brief burst and formant transition lasting a few tens 
of milliseconds. In contrast, the noun plurals elic- 
ited in this study are marked by the addition of a 
final /s/ or /z/. These phonemes in word-final posi- 
tion have much longer duration than the bursts asso- 
ciated with a final /d/ or /t/— sometimes 3 or 4 times 
longer. In addition, CI users do not have major diffi- 
culties detecting high-frequency sounds such as /s/ 
or /z/. Therefore, it is reasonable to assume that the 
morphological marker for noun plurals is perceptu- 
ally more prominent to CI users than the regular past 
tense marker. Likewise, we may also assume that the 
perceptual differences between the 2 forms of the 
uncontractible copula (“is” or “are”) are more promi- 
nent than the difference between singular and plural 
noun forms, because the words “is” and “are” have 
a long list of acoustic differences in addition to the 
presence of the final /z/ in one of the words. These 
differences include the first and second formant fre- 
quencies of the initial vowel, the presence of a retro- 
flex vowel with its characteristically low third for- 
mant frequency, and the duration difference between 
the initial short vowel in “is” and the initial long vow- 
el in “are.” 


Given the preceding considerations, results ob- 
tained from the CI users are consistent with the per- 
ceptual prominence hypothesis, because the percent- 
age of correct responses was highest for the copula 
probe (which is associated with the most perceptu- 
ally prominent markers), intermediate for the noun 


plural probe, and lowest for the regular past tense 
probe (which is associated with the markers that are 
least perceptually prominent). These results are in 
direct contrast with those obtained from normal-hear- 
ing chilcren, whether young or older, and even in- 
cluding those with delayed language development, 
because all of these groups (and the vast majority of 
individuals within each group) showed greater pro- 
ficiency in the use of noun plurals than in the use of 
the copula. It is possible to speculate that for nor- 
mal-hearing children, all morphological markers are 
perceptually very prominent and that therefore the 
sequence in which different grammatical skills de- 
velop is determined by factors other than perceptual 
prominence. In other words, the language instinct hy- 
pothesis is not necessarily wrong, but it seems to be 
insufficient in the case of children whose input is 
dramatically different from that of typical children 
with normal hearing. 


If the perceptual prominence hypothesis receives 
further support in future studies, clinicians who work 
in language rehabilitation of CI users may choose to 
target those aspects of grammar that are less acous- 
tically prominent to these children. In addition, and 
from a theoretical standpoint, these results suggest 
that patterns of language development can be strongly 
affected by the acoustic input. Thus, the admittedly 
quaint term “language instinct” may not be the most 
appropriate one to name the human innate language 
acquisition mechanism. 
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TALKER DISCRIMINATION BY PRELINGUALLY DEAF CHILDREN 
WITH COCHLEAR IMPLANTS: PRELIMINARY RESULTS 


MIRANDA CLEARY DAVID B. PISONI, PHD 
BLOOMINGTON, INDIANA 


Forty-four school-age children who had used a multichannel cochlear implant (CI) for at least 4 years were tested to assess their 
ability to discriminate differences between recorded pairs of female voices uttering sentences. Children were asked to respond “same 
voice” or “different voice” on each trial. Two conditions were examined. In one condition, the linguistic content of the sentence was 
always held constant and only the talker’s voice varied from trial to trial. In another condition, the linguistic content of the utterance 
also varied so that to correctly respond “same voice,” the child needed to recognize that two different sentences were spoken by the 
same talker. Data from normal-hearing children were used to establish that these tasks were well within the capabilities of children 
without hearing impairment. For the children with CIs, in the “fixed sentence condition” the mean proportion correct was 68%, 
which, although significantly different from the 50% score expected by chance, suggests that the children with CIs found this 
discrimination task rather difficult. In the “varied sentence condition,” however, the mean proportion correct was only 57%, indicat- 
ing that the children were essentially unable to recognize an unfamiliar talker’s voice when the linguistic content of the paired 
sentences differed. Correlations with other speech and language outcome measures are also reported. 


KEY WORDS — child, cochlear implant, indexical property, speech perception, talker discrimination, voice discrimination. 


INTRODUCTION unavailable, such as in communicating via telephone, 
in listening to the radio, or if the listener has momen- 


Al body of h has sh that l- 
ee TCSCATCI NAS SIOWN: Mat norma tarily turned away from the speaker’s face. 


hearing (NH) listeners are sensitive to properties in 


the speech signal that provide information about the In the present study, we asked pediatric users of 
talker who produced the utterance. These properties cochlear implants (CIs) to make perceptual judg- 
are sometimes referred to as “indexical” properties ments about whether pairs of recorded sentences were 
of speech and can convey, although imperfectly, in- spoken by the same or different talkers. Two condi- 
formation about the talker’s gender, age, regional tions were explored. In one condition, the linguistic 
background, emotional state, etc.1-3 Indexical infor- contents of the paired utterances were identical (re- 
mation is usually conceptualized as contrasting with ferred to henceforth as the “fixed sentence condi- 
“linguistic” information about the intended pattern tion”). In the other condition, the linguistic contents 
of phonemes/phonemic contrasts.4 Because linguis- of the two sentences always differed (the “varied sen- 
tic and indexical information are both simultaneously tence condition”). In this condition, it was necessary 
encoded in the acoustic waveform, the primary ques- for the listener to be able to identify two separate 
tion of interest to speech researchers is how the par- utterances as spoken by either the same talker or by 
allel extraction of these two types of information two different talkers. Because the talkers used in this 
takes place, and the degree to which these processes study were previously unfamiliar to the listeners, we 
interact with each other.5 reasoned that in order to perform the varied sentence 
The ability to use indexical information to percep- condition it would be necessary for the listeners to 
tually discriminate between the utterances of differ- form a representation, or expectation, of what the 
ent talkers is often taken for granted in communica- speaker of the first utterance in each pair would sound 
tive situations. In order to interpret what is being said LiKe in a subsequent, linguistically different utterance. 
in the larger context of a spoken conversation, a lis- Upon hearing the second utterance, the listener would 
tener must be able to keep track of the current speaker then be able to make a judgment about whether the 
and register a change of speaker when it occurs. For tWO sentences were, in fact, spoken by the same talker 
both NH and hearing-impaired persons, this task be- or by two different talkers. 
comes more difficult when associated visual cues are The 8- and 9-year-old children who participated 
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TABLE 1. DEMOGRAPHIC CHARACTERISTICS OF 











PEDIATRIC CI GROUP 
Mean Minimum Maximum SD 

Age at testing (y) 8.76 7.92 9.91 0.53 
Age at onset of 

deafness (mo) 2.52 0 36 7 
Duration of deafness 

(y) 2.94 0.58 5:17 1.11 
Duration of CI use (y) 5.60 4.09 6.87 0.66 
No. of active : 

electrodes 18.20 8 22 2.82 


CI — cochlear implant. 





in this study had all used a multichannel CI for at 
least 4 years. Forty-one of the children used a Nu- 
cleus-22 device. Two children used a Clarion implant. 
One child was a former Nucleus-22 user who had 
switched to the Nucleus-24 implant 4 months before 
testing. All of the children at the time of data collec- 
tion were using a coding strategy (either spectral peak 
or continuous interleaved sampling) capable of con- 
veying a fairly detailed spectral representation of the 
speech signal. Given the design of the device and 
the children’s history of use, we expected that at least 
some of the pediatric CI users would be able to make 
the simple voice discriminations presented under the 
“fixed sentence” condition. Because the linguistic 
content of the sentence was held constant across all 
comparisons, intertalker differences should consti- 
tute the primary source of any perceived acoustic 
variation between sentences. We anticipated that the 
“varied sentence” condition would prove more dif- 
ficult, because a generalizable representation of each 
talker’s voice is presumably necessary to accomplish 
this task. However, if for the purpose of the task at 
hand the children were able to ignore the linguistic 
variability as directed, the signal provided by the im- 
plant should be sufficient to permit some children to 
form the necessary representations of the different 
voices. Data from a group of younger NH children 
helped us assess the overall difficulty of this “varied 
sentence” condition. 


METHODS 
PARTICIPANTS 


NH Children. Twenty-one NH children were tested 
as part of a larger project being conducted at the In- 
diana University Speech Research Laboratory. The 
children ranged in age from 5 years 3 months to 5 
years 8 months (mean, 5 years 6 months; SD, 2 
months). 


Pediatric CI Users. Forty-four hearing-impaired 
children with CIs participated as part of a larger study 
currently being conducted at Central Institute for the 
Deaf.® As shown in Table 1, the children ranged in 


age from 7.92 to 9.91 years at the time of testing 
(mean, 8.76 years). All pediatric CI users in this study 
had lost their hearing before 3 years of age; the ma- 
jority were reported as congenitally deaf. The dura- 
tion of deafness before implantation averaged ap- 
proximately 3 years, and every child had used his or 
her implant for at least 4 years before the present 
testing. The group included both children who used 
oral communication and children who used total com- 
munication. 


The NH children were not recruited as a direct com- 
parison group for the CI group. Nevertheless, we be- 
lieve that reporting the NH children’s performance 
here is useful at this time to establish that the experi- 
mental procedure used was within the perceptual and 
cognitive abilities of normally developing children 
3 to 4 years younger than the CI users in this study. 


STIMULUS MATERIALS 


In addition to similarities between talkers, key fac- 
tors in determining the difficulty of a talker discrimi- 
nation task are the amount of information provided 
per talker and the number of different talkers among 
which the listener is asked to discriminate.”8 We 
therefore used relatively long sentence-length stimuli 
and only a very small set of 3 female talkers. The 
stimuli were selected from the Indiana Multi-Talker 
Sentence Database (IMTSD),?:!9 a CD-ROM con- 
taining digital recordings of 21 talkers each uttering 
100 sentences selected from the Harvard Sentence 
lists.!!-!2 All sound files were sampled at 20 kHz with 
16-bit amplitude quantization and normalized such 
that the average root-mean-square values for all files 
were equated. 


Eight sentences were used for the practice trials, 
and another 24 sentences were selected for use dur- 
ing the test trials. The sentences were selected to have 
roughly similar construction, and all tokens were be- 
tween 1.50 and 2.25 seconds in duration (8 to 11 syl- 
lables; eg, “The lazy cow lay in the cool grass”). An 
effort was made to select sentences that contain vo- 
cabulary the children would be familiar with; how- 
ever, because of the nature of the available database, 
there remain some words that are probably unfamil- 
iar to hearing-impaired children (eg, “colt’’). 


Tokens from 2 male talkers were selected for the 
practice trials, and tokens from 3 female talkers were 
selected for the test trials. The 3 female talkers used 
for the test stimuli were talkers 6, 7, and 23 from the 
IMTSD. This set was deliberately chosen to include 
some separation between the talkers’ mean funda- 
mental frequency (F0) values. As reported by Brad- 
low et al,!° the mean FO values for each talker over 
the full set of sentences contained in the database 








_ 
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were as follows: talker 6, approximately 168 Hz; talk- 
er 7, approximately 179 Hz; and talker 23, approxi- 
mately 237 Hz. The recordings from the 3 talkers 
were otherwise similar in that all were produced by 
highly intelligible female adults with similar speak- 
ing rates, similar regional accents, and no marked 
emotional quality. 


Because a same-different discrimination task was 
to be used, 6 trials representing every possible or- 
dered pairing of the 3 voices were used for the “dif- 
ferent voice” trials. For the 6 “same voice” trials, 
each of the 3 voices was paired with itself twice. This 
arrangement was used in both the fixed sentence and 
varied sentence conditions. Within each pair, a 1-sec- 
ond silent interval was inserted between the offset 
of the first sentence and the onset of the second sen- 
tence. 


PROCEDURE 


NH Children. Each of the 21 NH children passed 
a hearing screening at 250 Hz, 500 Hz, 1 kHz, 2kHz, 
and 4 kHz at a level of 20 dB hearing level on a por- 
table pure tone audiometer (MA27; Maico, Minne- 
apolis, Minnesota) and TDH-39P headphones. A re- 
sponse at 25 dB hearing level was accepted for 250 
Hz. Left and right ears were tested separately. 


The NH children received only the varied sentence 


-condition of the talker discrimination task. After the 


child was instructed about the basic nature of the task, 
4 practice trials were administered with a personal 
computer and a tabletop loudspeaker. All children 
performed the same ordering of practice trials (same, 
different, same, different) with stimuli from 2 male 
talkers. On the first 2 practice trials, the experimenter 
modeled the task by giving the correct answer after 
the pair of sentences was played. The child was then 
encouraged to do the last 2 practice trials on his or 
her own, and feedback was provided. During the 
practice period, the experimenter explained that if 
the child was not sure about the correct answer, he 
or she could ask for the same pair of sentences to be 
presented again, up to 2 additional times. This op- 
tion was available for both the practice and test tri- 
als. The 12 test trials were presented via headphones 
(DT100; Beyerdynamic, Farmingdale, New York), 
and the examiner was unable to hear the current trial 
as it was played. The child was asked to verbally 
report whether the 2 talkers were the “same” or “dif- 
ferent.” Assignment of the 24 different sentences to 
the 12 test trial pairs, and the order of presentation 
of the test trials were pseudorandomized by the com- 
puter. The child received no explicit feedback dur- 
ing the test trials regarding the accuracy of his or her 
performance. 


Pediatric CI Users. The pediatric CI users were 


tested in a manner very similar to that of the NH 
children, except that the discrimination task involved 
an additional condition. The fixed sentence condi- 
tion was administered first, followed by the varied 
sentence condition. In the fixed sentence condition, 
the child heard only 1 sentence across all 12 trials, 
as spoken by the 3 different talkers. The assignment 
of this sentence was balanced such that each of the 
24 sentences selected for use in the varied sentence 
condition was heard in the fixed sentence condition 
by approximately 2 children. 


All children with CIs first underwent the same 4 
fixed sentence practice trials with a single sentence 
and 2 different male voices. Twelve randomized test 
trials were then administered in the fixed sentence 
condition. The child next received the revised instruc- 
tions for the varied sentence condition. These instruc- 
tions alerted the child that the sentence content of 
the trials would vary, but emphasized that the pri- 
mary task of “listening to the voice” had not changed. 
All children then underwent the same 4 varied sen- 
tence practice trials with 8 different sentences and 2 
different male voices. Finally, 12 randomized test tri- 
als with the 3 female talkers and 24 different sen- 
tences were administered. Four different pseudoran- 
dom assignments of the 24 different sentences to the 
12 available test pairs were generated before testing, 
and nearly equal numbers of children were tested with 
each randomization. The children received no explicit 
feedback during any of the test trials regarding the 
accuracy of their responses. 


The pediatric CI users were tested with a Macin- 
tosh portable laptop computer. The stimuli were pre- 
sented via a loudspeaker (AV280; Advent, Benica, 
California) at approximately 70 dB sound pressure 
level. In some cases, the level was adjusted upward 
at the request of the child. The presentation of all 
stimuli was audible to the examiner. Although the 
practice trials were repeated for a few children in 
order to get the child on task, no test pairs were re- 
peated. Although this is different from the method 
used with the NH children, the impact of this change 
is probably small, because very few of the NH pre- 
schoolers requested any repetitions of the test trials. 


The procedures followed with the CI users were 
administered by a clinician experienced in working 
with hearing-impaired children. This clinician was 
trained in the task administration by the researcher 
responsible for gathering the data from the NH chil- 
dren, 


RESULTS 


NH Children. The NH children had very little dif- 
ficulty with the varied sentence condition on which 
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they were tested, scoring 89% correct on average as 
a group. Most children scored either 12 of 12 or 11 
of 12 correct. The distribution of scores obtained from 
the NH children is shown on the bottom panel of the 
Figure. The scores of the children as a group dif- 
fered significantly from a chance performance of 50% 
(t(20) = 15.28; p < .001). The few errors that were 
observed primarily involved children’s incorrectly 
responding “same” for different voice pairs involv- 
ing comparisons between talkers 6 and 7. Very few 
other errors were obtained. 


Pediatric CI Users. The score distributions obtained 
from the CI users for both the fixed and varied condi- 
tions are shown in the top and center panels of the 
Figure. The mean accuracy for the group in the fixed 
sentence condition was 68%, which is significantly 
above a chance performance of 50% (t(43) = 7.13; p 
< .001). The mean accuracy for the group in the var- 


TABLE 2. MISS RATES AND FALSE ALARM RATES 
FOR CI GROUP 











Fixed Varied 
Sentence Sentence 
Error Type Condition Condition 
Responded “same” when different 
(miss rate) 30 34 
Responded “different” when same 
(false alarm rate) 35 92 





Mean Percent 
Correct = 57% 


Fixed Sentence 


Condition 


Cl Group 


Mean Percent 
Correct = 68% 


Varied Sentence 


Condition 


CI Grou Distribution of talker discrimination scores A) 


in fixed and varied sentence conditions for 44 
pediatric cochlear implant (CI) users and B) in 
varied sentence condition for group of 21 nor- 
mal-hearing (NH) 5-year-olds. 


Varied Sentence 


Condition 


NH Group 


Mean Percent 
Correct = 89% 


ied sentence condition was 57% correct, which, al- 
though significantly above a chance performance of 
50% (t(43) = 3.10; p = .003), indicates that the pedi- 
atric CI users encountered considerable difficulty 
with this task. A paired-samples t-test between the 
scores in the 2 conditions showed a significant de- 
crease in scores for the varied sentence task as com- 
pared to the fixed sentence task (t(43) = 3.66; p = 
.001). The scores in the 2 conditions showed a weak 
but significant positive correlation (r = .30, p = .049). 


The CI users clearly had much greater difficulty 
with the varied sentence condition of the talker dis- 
crimination task than did the NH children. Although 
we did not test the NH children on the fixed sen- 
tence condition, it is very likely that they would have 
done extremely well — probably better than the 89% 
they scored on the varied sentence condition. 


Table 2 shows the miss and false alarm error rates 
obtained in each condition for the CI users. One pat- 
tern evident in Table 2 is a bias for more often incor- 
rectly responding “different” rather than “same” for 
pairs tested in the varied sentence condition. No such 
response bias was observed in the NH children. This 
pattern of results suggests that the children in the CI 
group may have found it difficult to ignore the lin- 
guistic variability present in the varied sentence con- 
dition as instructed. 
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TABLE 3. SIMPLE BIVARIATE CORRELATIONS 
BETWEEN TALKER DISCRIMINATION PERFORMANCE 
AND DEMOGRAPHIC VARIABLES 


TABLE 4. SIMPLE BIVARIATE CORRELATIONS 
BETWEEN TALKER DISCRIMINATION PERFORMANCE 
AND WORD RECOGNITION MEASURES 























Proportion Proportion 





Correct Correct 
in Fixed in Varied 
Sentence Sentence 
Condition Condition 
Age at testing (y) -.13 .06 
Age at onset of deafness (mo) 19 16 
Duration of deafness (y) —.12 .06 
Duration of CI use (y) —.06 —.19 
No. of active electrodes .26 .19 
Degree of exposure to oral-only 
communication environment 21 32* 


Data are r values. 
*Correlation is significant at .05 level (2-tailed). 











Despite the fact that the talker discrimination scores 
obtained were not very continuously distributed be- 
cause of the small number of trials, the variability 
present in the obtained scores allowed us to calcu- 
late correlations between talker discrimination scores 
and other measures available for these children. Be- 
cause the pediatric CI users were nearly at chance in 
the varied sentence condition, meaningful correla- 
tions obtained with this measure are unlikely, and 
although shown in Tables 3 and 4,!3-!5 must be inter- 
preted cautiously. 


The results shown in Table 3 indicate that within 
this sample of CI users, talker discrimination perfor- 
mance was not significantly correlated in either con- 
dition with age at testing, age at onset of deafness, 
duration of deafness, or duration of implant use. Re- 
call, however, that the children were preselected to 
demonstrate relatively little variability along these 
dimensions. Weak evidence was found that a greater 
number of active electrodes and more exposure to 
oral-only communication were positively associated 
with better talker discrimination scores. Exposure to 
oral-only communication was quantified by the com- 
munication mode scoring procedure described by 
Geers et al, which takes into account the type of 
communication environment experienced by the child 
in the year just before implantation, in each year over 
the first 3 years of CI use, and in the year just before 
the current testing. An independent-samples t-test 
using a median split along the variable of communi- 
cation mode score indicated that the “primarily oral” 
group’s mean of 62% correct on the varied sentence 
condition was, in fact, significantly higher than the 
“primarily total communication” group’s mean of 
52% correct (42) = 2.41; p = .021). 


As shown in Table 4, the talker discrimination 
scores were positively correlated with three outcome 
measures of spoken word recognition gathered by 


Proportion Proportion 


Correct Correct 
in Fixed in Varied 
Sentence Sentence 

Condition Condition 





Word Intelligibility by Picture 

Identification (WIPI) Closed-Set 

Word Identification}!5 .60* 36t 
Bamford-Kowal-Bench Open-Set 

Sentence Test, Key Word 

Identification (BKB)!4 44* 16 
Lexical Neighborhood Test Easy 

Word Lists Open-Set Word 

Identification (LNTe)!5 A8* 32t 

Data are r values. 
*Correlation is significant at .01 level (2-tailed). 
}Correlation is significant at .05 level (2-tailed). 











clinicians at Central Institute for the Deaf for another 
project within a few days of the talker discrimina- 
tion data. These correlations were moderately large 
and statistically significant in the case of the fixed 
sentence condition, suggesting that children who are 
better able to identify spoken words are also better 
equipped to perceive acoustic information useful for 
discriminating between talkers. 


DISCUSSION 


The talker discrimination results reported in this 
paper are preliminary, and we are currently in the pro- 
cess of expanding the scope of this research project 
in a number of directions. Our results do, however, 
confirm the expectation that prelingually deafened 
children who have acquired language via a multi- 
channel CI have more difficulty discriminating be- 
tween similar-sounding talkers than do NH children, 
particularly under conditions in which the linguistic 
content of the message is varied. 


In general, the existing literature indicates that for 
children with CIs, the large acoustic differences in 
Fo that distinguish declarative versus WH-question 
intonation and male versus female speech are fairly 
easily discriminated, even before phonemic distinc- 
tions are readily made.!® Because our stimulus set 
required a more difficult discrimination than was typi- 
cally used in the few studies that have asked chil- 
dren with CIs to make judgments about the indexi- 
cal properties of speech, we did expect that the hear- 
ing-impaired children would find our talker discrimi- 
nation tasks more difficult. Even so, we were sur- 
prised at just how difficult many of the children with 
CIs found the varied sentence condition, particularly 
given that this particular discrimination was well 
within the capacities of normally developing chil- 
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dren. We suggest that although the device does con- 
vey the relevant spectral detail, children with CIs 
have difficulty interpreting the information that NH 
listeners routinely use to recognize subtle indexical 
differences between talkers. Given that the ability to 
explicitly discriminate between talkers develops over 
time, even in NH children, with the ability to recog- 
nize briefly studied unfamiliar voices continuing to 
improve throughout the school-age years and reach- 
ing adult levels by adolescence,!’ we think it is prob- 





able that children with CIs have the ability to im- 
prove their skills in this area. It is not known, how- 
ever, whether this improvement can be brought about 
simply through greater everyday experience with the 
implant. or whether explicit training is necessary. As 
prelingually deafened children with CIs begin to en- 
ter mainstream classrooms in larger numbers, it will 
become increasingly important to understand how 
these children encode and process the enormous in- 
dexical variability present in spoken language. 
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COGNITIVE PROCESSING IN CHILDREN USING COCHLEAR 
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We performed this study to determine whether children using a cochlear implant performed differently from age- and gender- 
matched hearing aid users on 8 neuropsychological measures of visual memory, attention, and executive functioning. The study also 
examined whether differences in cognitive skills could account for some of the observed variance in speech perception, vocabulary, 
and language abilities of hearing-impaired children. In contrast to previous studies, our results revealed no significant cognitive 
differences between children who use a cochlear implant and children who use hearing aids. Partial correlation analysis indicated 
that the children’s visual memory skills, ie, their recognition memory, delayed recall, and paired associative learning memory skills, 
correlated significantly with their language skills. When examined at a significance level of .01, attention and executive functioning 
skills did not relate to the children’s developing speech perception, vocabulary, or language skills. The results suggested that differ- 
ences in visual memory skills may account for some of the variance seen in the language abilities of children using implants and 


children using hearing aids. 


KEY WORDS — attention, child, cochlear implant, cognition, hearing aid, language, memory, speech perception, vocabulary. 


INTRODUCTION 


Although many children who use a cochlear im- 
plant develop substantial improvements in their 
speech perception, word recognition, and language 
skills,!- there is still great variation in the perfor- 
mance levels achieved. Factors such as implant ex- 
perience and school setting are known to contribute 
to this variation; however, much of the variation re- 
mains unaccounted for.?5 Recently, it has been pro- 
posed that the integrity of cognitive functions such 
as attention, memory, and executive functions may 
influence language acquisition after implantation.® 


To date, most studies of nonlanguage functions in 
children using cochlear implants have investigated 
attentional functions. This focus relates to observed 
behavioral similarities between profoundly hearing- 
impaired children and children with disorders char- 
acterized by impulsiveness (eg, attention deficit hy- 
peractivity disorder) and hyperactivity.”-!° Attention- 
al functions are necessary for joint attention and turn- 
taking skills, which are two basic stages of language 
development. Attentional functions are also related 
to profoundly hearing-impaired children’s reading 





skills, conceptual thinking, and classroom perfor- 
mance.!! The data to date suggest that children with 
a cochlear implant have superior attentional abilities 
as compared to profoundly hearing-impaired chil- 
dren who use either hearing aids or vibrotactile aids, 
although this difference may be moderated by matu- 
ration. 1213 For example, between the ages of 6 and 8 
years, profoundly hearing-impaired children with and 
without cochlear implants performed the same on con- 
tinuous performance tasks (CPTs), and both groups 
performed significantly worse than age-matched hear- 
ing controls.!2 However, the performance of another 
group of children with implants between 9 and 13 
years of age was similar to that of normally hearing 
controls and was superior to that of profoundly hear- 
ing-impaired children without implants. !2 Similarly, 
although CPT performance improves with increas- 
ing age in all hearing-impaired children, the magni- 
tude of the developmental trajectory in attentional 
performance was greatest in children 7 to 9 years of 
age who had implants.!3 These studies included chil- 
dren who participated in either an oral or a total com- 
munication education program. Profoundly hearing- 
impaired children may develop a distribution of atten- 
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tional processes different from that of normally hear- 
ing children, so that the visual system takes on dual 
roles: that of the ears, in attending to the surround- 
ing environment for changes, as well as the usual one 
of the eyes, in focusing attention on a specific task.!3 
Children provided with additional auditory informa- 
tion via an implant may, in conjunction with devel- 
oping attention skills, develop a more evenly distrib- 
uted attention system that allows them to perform 
similarly to normally hearing children on visually 
assessed attention tasks.!3 Observed differences be- 
tween young versus older children with implants 
could be attributed to developing attentional pro- 
cesses that develop most rapidly between the ages 
of 5 and 7 years.!* 


Although these findings suggest that attention 
skills may differ in hearing-impaired children in re- 
lation to their access to auditory information, recent 
claims regarding the absence of a sound neuroscien- 
tific framework renders CPTs a poor model of atten- 
tional function.!> Furthermore, as Mitchell and Quitt- 
ner! noted, performance impairments on CPTs could 
arise from dysfunction of cognitive abilities other than 
attention. In addition to attending to the stimuli, CPTs 
also require that children remember which stimuli 
are targets and which are not, inhibit responses to 
nontargets, and select appropriate movement se- 
quences at the appearance of the target. Impairments 
in any of these functions could give rise to poor per- 
formance on CPTs even if attentional processes were 
normal. Neuropsychological models consider move- 
ment sequencing, the inhibition of responses, and 
working memory as executive functions, all of which 
could also give rise to poor performance on CPTs 
even if attentional processes were normal.!? There- 
fore, although these studies suggest that attention 
skills may differ in children using a cochlear implant 
versus profoundly hearing-impaired children, a broad- 
er investigation of different cognitive processes is 
warranted. This investigation should use neuropsy- 
chological tests that are based on a sound neurosci- 
entific framework, that have a demonstrated sensi- 
tivity to the effects of maturation, and that can be 
used to study cognition in children with limited lin- 
guistic skills because of their young age, profound 
hearing loss, or both. The current study compared a 
range of visual attentional, executive, and memory 
abilities between implant users and hearing aid us- 
ers with a battery of nonverbal computer-adminis- 
tered neuropsychological tasks, relating their perfor- 
mance to their developing speech perception, vocabu- 
lary, and language skills. 


METHOD 
Subjects. The participants were drawn from an on- 


going longitudinal language study and are described 
in detail elsewhere.! The sample consisted of 24 Nu- 
cleus cochlear implant users and 24 hearing aid us- 
ers selected so that the two device groups would be 
matched overall for age and gender. Their ages ranged 
between 6.0 and 14.5 years, with an average of 9.39 
years (SD, 2.26 years) for the entire sample. All par- 
ticipants attended an oral-aural educational setting, 
and all had prelingual hearing loss. No subjects were 
color-blind. To obtain suitable age and gender matches 
for the cochlear implant group as a whole, we in- 
cluded hearing aid users with various degrees of hear- 
ing loss; this group consisted of 11 children with a 
profound loss, 8 children with a severe loss, and 5 
children with a moderate to severe loss. 


Measures of Cognitive Abilities. The participants’ 
developing attention, executive functions, and visual 
memory skills were assessed with the Cambridge 
Neuropsychological Test Automated Battery (CAN- 
TAB),!* a computed nonverbal battery of visually 
presented neuropsychological tests. The tests were 
administered on a touch-sensitive computer screen 
and were completed by the children within 4 months 
of their annual language assessment. The CANTAB 
battery consisted of 9 subtests. The Motor Reaction 
Time task was administered initially to all subjects, 
purely to allow them to gain familiarity with using a 
touch screen. The remaining 8 tests were divided un- 
der 2 group headings: 4 visual memory subtests and 
4 tests of attention and executive functioning. The 
presentation order of these 2 subtest groupings was 
balanced across subjects, as was the ordering of the 
tests within the groupings. The majority of subjects 
completed the 8 subtests in 2 hour-long sessions con- 
ducted either at school or at home in a quiet room. 
Each subtest was demonstrated up to a set point with 
simple verbal instructions accompanied by some 
basic gestures, then restarted for the subject. The sub- 
jects were provided with feedback from the computer 
program and were given verbal encouragement by 
the experimenter as needed. All subjects were re- 
warded with stickers for their involvement. Each sub- 
test will be described briefly. The reader is referred 
to the reports of Morris et al,!? Roberts and Saha- 
kian,? and Luciana and Nelson?! for detailed expla- 
nations of each subtest. 


Visual Memory Subtests. The Pattern and Spatial 
Recognition test assessed recognition memory skills 
for recently presented, nonverbal patterned and spa- 
tial stimuli from 2 items: | matching the target, the 
other a distractor. Percentage correct scores were cal- 
culated out of 24 items for pattern recognition and 
out of 20 for spatial recognition. The Delayed Match- 
ing to Sample (DMTS) test examined delayed match- 
ing of complex, nonverbal patterns from among 4 
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response choices. One response item matched the tar- 
get, 2 matched either the colors or patterns of the 
target stimulus, and the remaining option acted as a 
distractor. The task included 3 delay conditions — 
0-, 4-, and 12-second delays — as well as a simulta- 
neous option, during which the target stimuli re- 
mained on the screen while the 4 response options 
were shown. Each condition was assessed 10 times. 
From this, total percentage correct scores were calcu- 
lated for each condition. The Paired Associates (PA) 
test assessed the subject’s ability to quickly learn as- 
sociations between various patterns and their presen- 
tation location on the touch screen. The task increased 
in complexity from 1-pattern location to recall, to 2-, 
then 3-, 6-, and 8-pattern locations to retain and re- 
call. The program recorded the number of trials re- 
quired by the subject to place each pattern in its cor- 
rect location, in addition to the overall number of 
completed levels and errors made. 


Attention and Executive Functions Subtests. The 
Intradimensional/Extradimensional Set-Shifting Task 
(ID/ED) measured discrimination and reversal learn- 
ing while the subject shifted his or her attention to 
changing visual stimuli. This task contained 9 stages 
of increasing difficulty. At each stage, the child was 
presented with 2 stimuli. By touching 1 stimulus, the 
subject received feedback from the computer as to 
whether they had selected the “correct” response. 
Once the child had identified the correct item, they 
were encouraged to continue selecting that item un- 
til, at an unspecified time, the rule changed. The task 
complexity increased, requiring the subject to apply 
a previously learned rule to new stimuli and finally 
to a previously ignored aspect of the stimulus. The 
program recorded the number of responses made by 
the subject to ascertain and complete each of the task 
levels and also the number of errors. The Spatial Span 
test measured memory for a figural sequence. White 
boxes presented randomly on the touch screen 
changed color in a particular sequence, which the 
subject was to repeat after a brief delay. This task 
increased in complexity, beginning with a 2-box se- 
quence and continuing until the subject was unable 
to repeat the sequence correctly. The highest num- 
ber of boxes correctly touched in sequence was re- 
corded. The Spatial Working Memory (SWM) test 
measured working memory for spatial stimuli and 
required the use of mnemonic information to search 
effectively. The subject searched through boxes to 
locate hidden tokens. Each box contained only 1 to- 
ken at some period during the trial. Throughout the 
test, the number of boxes to search through increased 
from 2 to 3, 4, 6, and finally 8 boxes; each task level 
had 4 trials. The subject’s efficiency in searching for 
and recalling the location of previously obtained to- 


kens was recorded, providing specific information 
on his or her strategy skills. The Tower of London 
test measures spatial planning and behavioral inhi- 
bition. The subjects were presented with 2 images of 
3 colored balls arranged in “pockets.” The “bottom” 
ball arrangement was slightly different from the “top” 
image arrangement. Using as few ball moves as pos- 
sible, the subject was to match the bottom image to 
the top image. The task complexity increased from 
2-move trials to 3-, 4-, and 5-move trials. The aver- 


-age number of moves required by the subject to com- 


plete each task level was recorded. 


Language, Vocabulary, and Speech Perception As- 
sessments. The data were obtained from the subject’s 
annual language assessment performed for the on- 
going language study. The reader is referred to Bla- 
mey et al! for detailed test descriptions and adminis- 
tration procedures. In brief, the children’s open-set 
speech perception skills were assessed with the Bam- 
ford-Kowal-Bench (BKB) sentences list22 and the 
Consonant-Nucleus-Consonant (CNC) word lists,?3 
which were administered twice in the auditory-alone 
condition. The calculated average number of key 
words repeated correctly from the 2 BKB assess- 
ments and the average number of words and average 
number of phonemes repeated from the CNC testing 
were included in the current analysis. The Northwest- 
ern University Children’s Perception of Speech (NU- 
CHIPS) closed-set consonant perception test was 
administered in the auditory-alone condition, and the 
percentage of items correctly identified was noted. 
Vocabulary was measured with the Peabody Picture 
Vocabulary Test-Revised (PPVT-R),2° and the raw 
score was calculated. Depending on the subject’s lan- 
guage age, they completed either the Clinical Evalu- 
ation of Language Fundamentals (CELF)—Preschool 
test, which contained 6 subtests,2° or the CELF-3 test, 
which included 9 subtests.27 In the current sample, 
16 subjects with implants (mean age [SD], 8.76 + 
2.13 years) and 11 hearing aid users (8.28 + 1.87 
years) completed the CELF-Preschool test, and 8 
children with implants (10.12 + 1.8 years) and 13 
hearing aid users (10.98 + 1.94 years) completed the 
CELF-3 test. The CELF-3 subtests were selected on 
the basis of the subject’s chronological age. 


RESULTS 


The following variables required transformation 
by methods suggested by Tabachnick and Fidell?8; 
CANTAB variables, DMTS 12-second delay condi- 
tion, [D/ED Simple Discrimination and Compound 
Discrimination Reversal, Tower of London 2- and 
3-move trials, SWM Total Number of Double Er- 
rors—Language Variables, NU-CHIPS, and the CELF 
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TABLE 1. AVERAGE CANTAB SUBTEST SCORES BY GROUP 








Subjects With Cochlear Implants 


Subjects With Hearing Aids 








Mean SD N Mean SD N 

Pattern Recognition (total % correct) 80.79 9.54 23 78.12 13.36 24 
Spatial Recognition (total % correct) 65.65 13.84 23 69.37 12.54 24 
Delayed Matching to Sample 

Simultaneous % correct 90.00 13.19 24 91.67 10.07 24 

Average % correct on 0-s, 4-s, and 12-s trials 60.97 10.47 24 63.19 18.66 24 
Paired Associates 

Total No. of levels completed 8.00 0.00 24 8.00 0.00 24 

Total No. of levels completed on first trial 18.87 3.31 24 19.29 4.33 24 

Total No. of errors across all trials 11.29 8.54 24 11.79 10.20 24 
ID/ED 

No. of levels completed 8.00 1.84 24 8.33 1.37 24 

No. of errors across all trials 17.22 9.73 18 16.95 6.87 19 
Spatial Working Memory 

Total between-search errors 49.58 8.76 24 42.83 L377 24 

Total within-search errors 1.29 1.60 24 1.04 1.43 24 

Total double-search errors 2.83 5:71 24 1.54 2.65 24 

Strategy Score 39.33 2.41 24 38.25 3.11 24 
Spatial Span 5.08 1.53 24 5.00 1.21 24 
Tower of London (average No. of moves) 

2-move trial 2.08 0.28 24 2.00 0.00 24 

3-move trial 3.85 0.83 24 4.06 1.40 24 

4-move trial 6.35 0.96 24 6.06 1.15 24 

5-move trial 7.98 1.39 22 8.34 1.84 23 

Block 1 3.96 0.68 24 3.95 0.80 24 

Block 2 7.59 0.96 23 7.56 1.10 23 


CANTAB — Cambridge Neuropsychological Test Automated Battery, ID/ED — Intradimensional/Extradimensional Set-Shifting Task. 





subtests Word Classes, Sentence Structure, Word 
Structure, and Basic Concepts. 


Before the subjects’ speech perception, vocabu- 
lary, and language scores were compared to their per- 
formance on the cognitive tests, independent sample 
t-tests were calculated to determine whether the two 
device groups differed on their cognitive perfor- 
mance. The results revealed equivalent performances 
by both groups on all measures of visual memory, 
attention, and executive functioning as assessed by 
the CANTAB. Table | presents the average scores 
achieved by each device group on the tests of cogni- 
tion with untransformed values. Because previous re- 
search has identified cognitive differences based on 
the degree of hearing loss, the independent-samples 
t-tests were again calculated for all of the cognitive 
variables, including only matched profoundly hear- 
ing-impaired hearing aid users (N = 9) and subjects 
with implants (N = 9). The individual CANTAB re- 
sults for the 18 children are reported in Table 2. De- 
spite the small sample size, the results again revealed 
no significant differences between the two device 
groups. Because the two main device groups did not 
differ on any cognitive measure, the group results 


will be collapsed for the remaining analyses. 


Bivariate correlational analysis revealed numerous 
weak (.3) to strong (.7) relationships between the cog- 
nitive subtests and the various speech perception, vo- 
cabulary and language measures at significance lev- 
els from .05 to .001. Weak significant relationships 
at the level of .05 were identified between the NU- 
CHIPS, BKB, and CNC speech perception measures 
and some trials from various cognitive subtests; how- 
ever, the speech perception tests did not correlate with 
any of the major cognitive measures and therefore 
will not be examined further. 


Because cognitive performance increases with 
age,?! partial correlations were calculated between 
the identified significant bivariate correlations, with 
age at testing held as a constant. This analysis re- 
moved many significant correlations. Table 3 pre- 
sents the significant partial correlations between the 
major cognitive measures and the remaining vocabu- 
lary and language tests. Significant bivariate correla- 
tion values that did not alter when age at testing was 
held constant are also included in Table 3. Because 
of the small sample size, correlations significant at 
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TABLE 2. AVERAGE CANTAB RESULTS FOR MATCHED SUBGROUPS OF CHILDREN USING COCHLEAR IMPLANT 
OR HEARING AIDS 

















Cll CI2 CI3 CI4 CIS CI6 Cl7 CI8 CI9 HAl HA2 HA3 HA4 HAS HA6 HA7 HA8 HA9 


Pattern Recognition 


{total % correct) 100 75 87.5 83.3 79.2 75 
Spatial Recognition 

(total % correct) 70 70 75 75 55 65 
Delayed Matching to Sample 

Simultaneous % correct 100 100 90 100 100 90 

Average % correct on i 

0-s, 4-s, and 12-s trials 76.7 83.3 56.7 56.7 60 70 
Paired Associates 

Total No. of levels : 

completed 8 8 8 8 8 8 


Total No. of levels 
completed on first trial 6 6 6 5 3 6 


Total No. of errors across 
all trials 7 5 


ID/ED 
No. of levels completed 9 9 9 9 4 9 
No. of errors across all 


trials 6 25 10 H 28 8 
Spatial Working Memory 

Total between-search 

errors 51 45 53 56 60 48 

Total within-search errors 0 0 1 0 5 90 

Total double-search errors 0 © 1 1 28 2 

Strategy Score 40 37 37 38 41 36 
Spatial Span 8 5 4 5 3 5 
Tower of London (average 

No. of moves) 

2-move trial 2 2 2 2 2 2 

3-move trial - 3.5 3 5 3.5 3.5 4 

4-move trial 5 7.75 7.75 5 6.5 5.75 

5-move trial 8.5 7.25 7.75 F 8725 

Block 1. 3.83 4.5 4.5 3.5 3.17 3.67 

Block 2 7 7.17 8.17 F 8.33 7 


F — failed to complete trials required to calculate score. 


the .05 level are reported in the Table; however, be- 
cause of the large number of correlations, only those 
with a significance value of .01 or less will be inter- 
preted.?8 All correlations with vocabulary were sig- 
nificant only at the level of .05. When age at testing 
was held constant, no significant correlations re- 
mained between CELF and measures of attention and 
executive functioning. Significant partial correlations 
were found between the visual memory subtests and 
6 of the CELF-3 subtests and 1 CELF-Preschool lan- 
guage test at the significance level of .01..Visual mem- 
ory measures of recognition memory (Pattern Recog- 
nition), retention and delayed recall of complex in- 
formation (DMTS), and paired associative learning 
(PA) related to the children’s ability to respond to 
several of the CELF subtests. Calculated coefficients 
of determination values revealed that between 26% 
and 59% of the variance in the CELF language mea- 


100 


60 


80 


70.8 83.3 54.2 75 75 87.5 75 79.2 58.3 95.8 100 


80 60 55 50 50 70 75 90 50 90 80 


70 70 90 90 90 100 100 100 


66.7 83.3 66.7 76.7 


8 8 8 8 8 8 8 8 8 8 8 
7 6 5 5 5 7 5 6 5 6 7 
2 9 2 19 9 7 206 8 5 4 3 
9 9 7 9 9 9 9 9 9 5 FY 
18 17 36 19 25 29 32 10 12 #31 9 
58 42 56 45 57 49 32 40 38 49 42 
5 0 2 2 2 0 0 0 2 0 0 
5 0 2 2 4 3 0 5 0 1 Ọ 
40 37 39 38 40 39 35 31 35 41 38 
8 4 3 5 5 5 4 5 5 8 6 
22 Be DZ Y2. 2 8D 2s Ds 2 2 
4 45 3 745 5 3 345 3 3 
5.5 7 6.5 6.5 7.25 7.25 4.5 6.5 5.75 4.5 5.5 
5 9.75 8.75 8.75 12 75675 6 85 8 85 
3.33 4.67 3.83 5 5.17 4.5 3.33 3 3.83 3 3.83 
5.67 8.67 88.17 94 8583 7783 7717 





sures and CANTAB scores was shared. 


DISCUSSION 


This study differs from previous research in that 
only orally educated children were included in the 
sample, hearing aid users with various degrees of 
loss participated, and a wide range of neuropsycho- 
logical tests were included, each based on sound neu- 
roscientific theories. Comparison of children who 
used either a cochlear implant or hearing aids on vari- 
ous measures of attention, visual memory, and exec- 
utive functioning revealed no significant differences, 
even when only profoundly hearing-impaired sub- 
jects were compared to matched implant users. The 
two device groups performed similarly on all tests 
of cognition — a finding suggesting that the cogni- 
tive skills of children with implants were not aided 
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TABLE 3. PARTIAL CORRELATION VALUES BETWEEN CANTAB SUBTESTS AND MEASURES OF VOCABULARY 
AND LANGUAGE SKILLS 











PPVT- C-3 C-3 C-3 C-3 C-3 C-3 C-3 C-3 C-3 Pre Pre Pre Pre 
R WC FS RS SA SR SSe ~WS* C&D WA LC RSC BCe FL 
Pattern Recognition (% correct) .609 65>4 |574 —53¢ —.49° .67b-d4 
(17) (16) (18) (14) (15) (12) 
Spatial Recognition (% correct) —43° 49c 50° —.55b¢ 46° 
(20) (17) (15) (15) (25) 
Delayed Matching to Sample 
Simultaneous (%) 133° Ale 
(45) (25) 
0-s delay (%) 5554 
(22) 
4-s delay (%) —,52° 
(15) 
12-s delay (%)* -.574 47° —.48° 42° 
(21) (18) (16) (21) 
Total % correct -.554 50° —.52¢ .71¢ 
(21) (18) (16) (21) 
Paired Associates 
6-pattern trial | .29¢ —.51bd 39c 
(45) (26) (25) 
6-pattern trial 3 574 —41¢ 
(25) (25) 
8-pattern trial 2 49° 778 
(18) (10) 
8-pattern trial 3 54d —_56b4 — 54d. 
(21) (19) (19) 
Total No. first trial all correct —.4]b« 
(22) 
ID/ED Compound Discrimination- 
Revised (f)# —.40° 
(24) 
Spatial Working Memory 
Total within-search errors —.42° 
(24) 
Tower of London 
3-move trial (f)* —.42¢ 
(24) 
4-move trial (ż) —.32° 61° 
(45) (10) 


Data are correlation values, with number of participants in correlation in parentheses. 

PPVT-R — vocabulary, C-3 — CELF-3, WC — Word Classes, FS — Formulated Sentences, RS — Recalling Sentences, SA — Sentence 
Assembly, SR — Semantic Relationships, SS — Sentence Structure, WS — Word Structure, C&D — Concepts and Directions, WA — Word 
Associations, pre — CELF-Preschool, LC — Linguistic Concepts, RSC — Recalling Sentences in Context, BC — Basic Concepts, FL — 


Formulating Labels. 
®Transformed variable. 


>Denotes bivariate correlation value, as partial correlation value did not differ. 


‘p< .05. 
dp < 01. 
ep < .001. 





by the additional auditory information provided by 
the implant. In the current sample of orally educated 
children, no differences were identified between the 
two device groups in their performance on any at- 
tention measures, including their abilities to discrimi- 
nate targets from nontargets (ID/ED) and to attend 
to detail (ID/ED and DMTS simultaneous condition). 


Partial correlation analysis indicated that when age 


at testing was held constant, attention and executive 
functioning did not relate to the children’s speech 
perception, vocabulary, or language skills. Even af- 
ter age at testing was held constant, visual memory 
skills were found to correlate significantly with the 
CELF. The creators of the CELF-3 recognize that 
memory skills are important for the subtests Con- 
cepts and Directions, Recalling Sentences, and Word 
Associations.?” The current study demonstrates that 
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6 of the 9 CELF-3 subtests require memory skills to 
various extents. The ability to recognize previously 
presented patterned information, the retention of com- 
plex information for various durations, and the abil- 
ity to make associations between patterns and loca- 
tions correlated at a level between .51 and .77 at the 
significance level of .01 with the identified CELF-3 
subtests and the CELF-Preschool measure. 


Although it is clear that certain cognitive skills 
are present at the time that language is first acquired, 
the relationship and interplay between developing 
cognitive and language abilities is unknown. It may 
be that children with good visual memory skills are 
able to remember, attend to, and process language 
information better than children with poor cognitive 
skills. Alternatively, despite the nonverbal design of 
the cognitive subtests, performance on some tasks 


may be aided by the use of verbal mnemonic strate- 
gies such as labeling abstract stimuli or acknowledg- 
ing aloud a stimulus location. Children with better 
language skills may apply verbal strategies that ef- 
fectively assist them to obtain higher scores on the 
CANTAB. Either way, confirmation of these find- 
ings with a larger sample could identify future direc- 
tions for habilitation and educational programs. 


The relationships identified in this study between 
language and cognition are important considerations 
in interpreting tests that assess the speech percep- 
tion, vocabulary, and language abilities of children 
with implants and other hearing-impaired children. 
In particular, the relationships between visual mem- 
ory skills and language should be examined further 
as a possible source of the unexplained variation in 
overall language performance. 
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This study was performed to investigate factors contributing to auditory, speech, language, and reading outcomes after 4 to 6 
years of multichannel cochlear implant use in children with prelingual deafness. The analysis controlled for the effects of child, 
family, and implant characteristics so that the educational factors most conducive to maximum implant benefit could be identified. 
We tested 136 children from across the United States and Canada. All were 8 or 9 years of age, had an onset of deafness before 3 
years of age, underwent implantation by 5 years of age, and resided in a monolingual English-speaking home environment. Charac- 
teristics of the child and the family (primarily nonverbal IQ) accounted for approximately 20% of the variance in outcome after 
implantation. An additional 24% was accounted for by implant characteristics and 12% by educational variables, particularly com- 
munication mode. Oral education appears to be an important educational choice for children who have undergone cochlear implan- 


tation before 5 years of age. 


KEY WORDS — cochlear implant, communication mode, deaf education, deafness. 


The advent of cochlear implantation has had a dra- 
matic effect on the achievements of young profound- 
ly deaf children. Children who undergo implanta- 
tion early in life, followed by a period of appropriate 
rehabilitation, can achieve speech and language skills 
that exceed levels observed in profoundly deaf chil- 
dren with hearing aids. However, there continues to 
be considerable variability in the performance out- 
comes of individual children.! The impact of educa- 
tional choices on performance after implantation has 
yet to be conclusively determined. 


In 1996, the Center for Applied Research in Child- 
hood Deafness at Central Institute for the Deaf be- 
gan a study, funded by the National Institutes of 
Health, entitled “Cochlear Implants and Education 
of the Deaf Child.” This study documents the effects 
of various educational and rehabilitation models on 
deaf children’s ability to understand, produce, and 
read English while using a Nucleus 22-channel co- 
chlear implant. This study was designed to reduce 
subject variability as much as possible through sam- 
ple selection criteria and to include a sufficiently large 
number of children to control for extraneous vari- 
ables that might affect the results. The goal is to pro- 
vide information that allows parents and educators 
to make informed educational choices designed to 
maximize a child’s hearing, speech, and language de- 
velopment after implantation.2 ` 





METHOD 


Participants. According to selection criteria, the 
children in the study were 8 or 9 years old, had be- 
come deaf before 3 years of age, had undergone im- 
plantation by 5 years of age, and had 4 to 6 years of 
experience with a Nucleus 22-channel cochlear im- 
plant. Children who met these criteria and were re- 
ported to exhibit normal intelligence and reside in 
an English-speaking home environment were re- 
cruited through cooperating implant centers across 
the United States and Canada. All children were 
tested under similar conditions by a consistent group 
of examiners on an identical battery of tests. The to- 
tal sample to be included in the study consisted of 
180 children. This report includes 136 children for 
whom data analysis is complete. Characteristics of 
these participants are summarized in Table 1. 


Procedure. A multivariate design was used to ex- 
amine the effects of independent variables (commu- 
nication method, classroom type, amount of therapy) 
on outcome measures (speech perception, speech pro- 
duction, language, and reading) after intervening 
variables associated with the child, the family, and 
the implant device were controlled for (see Figure). 
The intervening variables and outcome measures 
were all obtained at one point in time at data collec- 
tion camps held in St Louis when the children were 
8 or 9 years old. Independent variables were assessed 
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TABLE 1. SAMPLE CHARACTERISTICS FOR 136 





SUBJECTS 
Mean SD Minimum Maximum 
Age at testing 9y 6 mo 8y 9y11 mo 
Age at onset of 
deafness 4mo - 9mo 0 3y 
Age at 


implantation 3 y 6mo 9mo tlyl0mo Sy2mo 
PerformanceIQ 102 15 65 136 
Duration of , 

implant use 
Duration of 


5 y 6 mo 9 mo 3y9mo 7y6mo 


SPEAK use 3y 1 y 8 mo 0 5y2mo 
No. of active 
electrodes 18 2.86 6 22 





retrospectively and averaged across the years elapsed 
since the child underwent implantation. It was hy- 
pothesized that therapy and 2ducation factors after 
implantation might explain differences in perfor- 
mance on outcome measures once the impact of in- 
tervening variables associatec with child, family, and 
implant characteristics had b2en accounted for. 


Because each of the outccme variables is multi- 
faceted and not readily quartified by a single test, 
results of batteries of measures were combined for 
each outcome by principal components analysis. Per- 
formance on each outcome was reduced to a single 
factor score by creating a weighted combination of 
all tests in each outcome battery. The language out- 
come was further subdivided into 2 factors: one com- 
prising those measures that used only spoken lan- 
guage and another associated with language measures 
that used both speech and sign together. Next, step- 
wise multiple linear regression was used to examine 
the variance in each outcome accounted for: first by 
child and family characteristics, then by implant char- 
acteristics, and finally by the independent variables. 
This approach was designed to address which edu- 
cational choices help to achieve maximum outcome 
performance after implantation when the character- 
istics of the child, the family, and the implant itself 
are held constant. 

RESULTS 


Table 2 depicts the percen-age of variance in each 


INDEPENDENT VARIABLES —> —> —> —> > -— OUTCOME VARIABLES 


Communication Mode 


Speech Perception 






Classroom INTERVENING = Speech Production 


VARIABLES 


Therapy 


Language 





Child 


Reading 


Family 


implant 





Depiction of multivariate analysis in which effects of in- 
dependent variables on outcome measures are examined 
after intervening variable effects have been removed. 


outcome accounted for by characteristics of the child 
(age, age at onset, age at implantation, IQ) and the 
family (family size, parents’ education). Child and 
family characteristics accounted for 18% of explained 
variance in speech perception factor scores, 15% in 
speech production, 11% in spoken language, 17% in 
spoken and signed language, and 20% in reading. 
The variables that contributed significant indepen- 
dent variance are indicated by p level. Performance 
IQ accounted for significant variance in all outcomes. 
Children who were older and had later onset of deaf- 
ness had better reading scores. Children with later 
onset of deafness from smaller families with more 
educated parents tended to have better language when 
both speech and sign were considered together. 


Table 3 summarizes the added variance accounted 
for by implant characteristics once the variance due 
to child and family characteristics had been removed. 
Each of the implant variables contributed indepen- 
dent variance: duration of use of the spectral peak 
(SPEAK) processor (duration of implant use was not 
significant), number of active electrodes, dynamic 
range, and loudness growth. These variables ac- 
counted for 26% of additional explained variance in 
speech perception, 22% in speech production, 23% 
in spoken language, 21% in spoken and signed lan- 
guage, and 17% in reading. 


Table 4 summarizes the amount of added variance 
accounted for by educational variables after variance 
due to family, child, and implant characteristics had 


TABLE 2. CHILD AND FAMILY CHARACTERISTICS AFFECTING OUTCOMES 




















Speech 
Speech Speech Spoken and Sign 

Perception Production Language Language Reading 
Age p< .05 
Age at onset of deafness p<.05 p<.05 
Age at implantation 
Performance IQ p<.001 p<.0l p<.05 p< .05 p< .001 
Family size p<.05 
Parents’ education p<.05 ; 

11% 17% 20% 


Total variance 18% 15% 
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TABLE 3. IMPLANT CHARACTERISTICS AFFECTING OUTCOMES 














ES Speech 
Speech Speech Spoken and Sign 
Perception Production Language Language Reading 
Duration of SPEAK use p<.0l p<.0l p<.05 p<.01 p<.05 
No. of active electrodes p<.001 p<.05 p<.0l p<.05 p<.05 
Dynamic range p<.001 p<.0l p<.01 p<.0l p<.0l 
Loudness growth p<.05 p<.01 p<.01 p<.05 











Added variance 26% 22% 23% 21% 17% 


been removed. These variables accounted for 16% of 
the variance in speech perception, 18% in speech pro- 
duction, 10% in spoken language, 5% in spoken and 


brings to the task of learning to benefit from a co- 
chlear implant is good native intelligence as quanti- 
fied by a nonverbal measure. Once this variable is 





























< signed language, and 7% in reading. Use of an oral held constant, earlier age at implantation (for chil- 
communication mode contributed significant variance dren who undergo implantation before 5 years of age) 
to all outcomes except for spoken and signed lan- and later age at onset of deafness (for children who 
guage. The greater the emphasis on speech and audi- lose hearing before 3 years of age) are not associ- 
tory skill development, the better the outcome. “Type ated with better speech perception and production 
of classroom” reflects the amount of time spent in a skills. Although some period of normal hearing ap- 
mainstream class.’This variable was significant for pears to affect overall language and reading skill de- 
speech production, language, and reading. velopment, age at implantation has no such effect on 
Table 5 summarizes all outcomes combined to- any OREORE easily. This result suggests iati sense 
gether into a single overall outcome score. The vari- of urgency about performing implantation in infancy 
ables included in this analysis accounted for 54% of may De snnecessary, oral least dosere i SnEvAl: 
the variance outcome after implantation. Child and garon; 

- family characteristics (primarily IQ) accounted for Likewise, the studied characteristics of the family 
18%. Implant characteristics accounted for 24%, and into which the child is born do not seem to provide a 
educational factors (primarily communication mode) particular advantage or disadvantage (given that an 
accounted for 12%. adequate motivation for implantation and follow-up 

rehabilitation is present). The more highly educated 
DISCUSSION parents did not have children who achieved signifi- 
Children who undergo cochlear implantation be- cantly better outcomes after implantation (when the 
fore 5 years of age are presented with auditory infor-  child’s IQ was factored out). There was a tendency 
mation at a crucial time for speech and language de- for smaller families to have children who had some- 
velopment. A child’s ability to take advantage of this what better overall language development. 
information is the result of a variety of factors: those What the Implant Contributes. The overall func- 
Hab The che brings ne learning = SORRIER Is tioning of the cochlear implant predicts substantial 
those that are povided oy ihe implant itself, and those variance in all areas of outcome benefit. Although 
ad that T by me child's rehabilitation pros the overall duration of implant use (within the range 
gram. The results of this study help us to understand of 4 to 7 years) did not turn out to be an important 
the importance of each of these components. predictor of outcome, duration of use of the more 
What the Child Brings to the Learning Environ- recent SPEAK coding strategy had a substantial im- 
ment. The most important characteristic that the child pact in all outcome areas examined. As engineering 
TABLE 4. INDEPENDENT VARIABLES AFFECTING OUTCOMES 
Speech 
Speech Speech Spoken and Sign 
Perception Production Language Language Reading 
Hours of therapy 
Therapist experience 
Parent participation 

Bg School setting (public or private) 

Type of class (mainstream or special education) p<.05 p<.05 p<.01 
Communication mode p<.001 p< .001 p < .001 p<.05 
Added variance 16% 18% 10% 5% 7% 
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TABLE 5. VARIABLES AFFECTING OVERALL 











OUTCOME 
Predictor Variables Explained Variance 
Child and family 18% 
Implant 24% 
Rehabilitation 12% 
Total 54% 





improvements in speech processing are made, it is 
important that children gain access to the speech per- 
ception benefits provided as soon as they are avail- 
able. Because the field of electronic speech process- 
ing is evolving rapidly, upgrades in speech proces- 
sors may occur even more frequently. No child should 
be left with an earlier inferior processor, because the 
ability to benefit from upgrades is very apparent in 
these data. 


Better outcomes were associated with a larger num- 
ber of active electrodes in the cochlear implant MAP. 
The importance of the number of active electrodes 
should not be interpreted to suggest that more is al- 
ways better. All of the reported data were derived 
from the only implant device available when these 
children first underwent implantation (the Nucleus 
22-channel from Cochlear Corporation). Alternative 
cochlear implants that are currently available use dif- 
ferent speech processing strategies on different num- 
bers of electrodes. Ideally, the surgeon should make 
every attempt to insert a complete electrode array, 
and the audiologist should attempt to program as 
many electrodes as are inserted into the cochlea. 


The audiologist who programs the cochlear im- 





plant makes a particularly important contribution to 
the child’s potential successful outcome with the de- 
vice. Once the processing strategy and the number 
of active electrodes have been maximized, it is the 
audiologist’s role to achieve the most appropriate and 
balanced MAP possible for that child. A well-fitted 
MAP, as evidenced by a wide dynamic range and good 
growth of loudness, contributes substantially to posi- 
tive outcomes. This is a particularly important con- 
sideration in the MAP-ing of newly implanted in- 
fants, who may not be able to respond to traditional 
MAP-ing techniques. 


Whai the Rehabilitation Program Provides. The 
primary rehabilitative factor associated with perfor- 
mance outcome was educational emphasis on oral- 
aural communication. Communication mode was 
more important to auditory and spoken language de- 
velopment than any other rehabilitative factor exam- 
ined, including classroom placement (public or pri- 
vate; special education or mainstream), amount of 
therapy, experience of the therapist, and parent par- 
ticipation in therapy. The children whose educational 
programs emphasized dependence on speech and au- 
dition for communication were better able to use the 
information provided by the implant to hear, speak, 
and read. Use of sign communication with children 
with implants did not result in an advantage for over- 
all English language competence, even when the out- 
come measure included sign language. Oral educa- 
tion appears to be an important educational choice 
for children who have undergone cochlear implanta- 
tion before 5 years of age. 
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COCHLEAR IMPLANTS AND INFANTS: EXPECTATIONS AND 
OUTCOMES 
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This study followed the communication and motor development of 3 profoundly deaf infants who underwent cochlear implanta- 
tion. All infants were receiving auditory habilitation at the Sydney Cochlear Implant Centre, Australia, with the view to acquiring 
oral language skills and had English-speaking, hearing parents. Their progress was tracked with a battery of measurement tools 
before and after cochlear implantation. The results indicated noticeable growth in receptive language and motor skills after opera- 
tion. Although expressive language improved, the rate of growth was much slower than changes in other areas. Although some areas 
of the subjects’ communication skills remained delayed, the importance of tracking their progress according to their hearing age, as 


well as the chronological age, was evident. 


KEY WORDS — assessment battery, cochlear implant, communication, hearing age, infant, motor skill. 


INTRODUCTION 


With the advent of hearing screening programs for 
newborns, more infants with profound congenital 
hearing loss are being identified earlier. With this in- 
creased early identification, more infants are being 


put forward for cochlear implantation. Because im- ~ 


plantation in such young children is not internation- 
ally widespread, little is known about the effects of 
cochlear implantation on young children’s develop- 
ment, particularly their communication. A complica- 
tion associated with this current trend toward per- 
forming implantation in infants is the lack of appro- 
priately validated measures to evaluate communica- 
tion for infants with profound deafness. Indeed, it is 
difficult to ascertain what should be expected from 
these infants in the way of normal prelinguistic de- 
velopment, because of the limited information avail- 
able about communication and social skills in infants 
with profound hearing loss.! 


The small number of studies that have attempted 
to address communication and social skills in deaf 
infants have for the most part focused on only one 
aspect of communication or social skill development 
and used only single or small numbers of subjects.24 
Other research on infants born deaf have reported 
results for a wide variety of hearing levels.2+-7 Un- 
fortunately, such reporting makes it impossible to 
know how many of the infants had hearing losses 
that would be considered to be profound — a level 
of loss that has a tremendous impact on communica- 
tion development. Regardless of these methodologi- 
cal issues, however, investigators agree that a pro- 








found hearing loss in early childhood has a dramatic 
impact on the child’s development of communica- 
tion and social skills.2-+8 


In summary, there is limited information in the re- 
ported literature that documents the early communi- 
cation development of infants with hearing losses in 
the profound range, particularly for those who under- 
go cochlear implantation. Nor is there a recognized 
battery of evaluation procedures available in the lit- 
erature to evaluate communication skills in infants 
with profound deafness who are younger than 18 
months of age. This research aimed to document the 
oral communication development, before and after 
cochlear implantation, of 3 infants with a profound 
hearing loss who had an early diagnosis, were fitted 
with hearing aids early, and were seen for their first 
cochlear implant evaluation at or before 11 months 
of age. In addition, the relationships among commu- 
nication development, cognitive level, gross motor 
skills, and “hearing age” were examined. “Hearing 
age” was defined in terms of the date of hearing aid 
compliance, that is, when an infant was wearing hear- 
ing aids during all waking hours. 


METHODS 
SUBJECTS 


Subjects were selected for implantation on the ba- 
sis of a number of criteria. These included a bilateral 
profound hearing loss and hearing parents who used 
English as their first language. The infants also had 
to have begun candidacy evaluation for cochlear im- 
plantation at the Children’s Cochlear Implant Cen- 
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TABLE 1. BIOGRAPHICAL DATA a 
Subject] Subject2 Subject 3* 











Age at diagnosis 5 mo 7 mo | wk 
Cause LVAS Genetic Genetic 
Age fitted with hearing 

aids 7 mo 2 mo 7 mo 

Age at CI evaluation 11 mo 11 mo 8 mo 

Age at surgery 15 mo 15 mo 11 mo 

Age at CI activation 15 mo 15 mo 12 mo 

Age at 6-month ly9mo lyl0mo 1ly6mo 
assessment (ly2mo) (ly3mo) (1 y4mo) 
(hearing aget) 

Age at 12-month 2y4mo NA 2y 
assessment (| y 9 mo) (1 y 10 mo) 
(hearing aget) 

Device implanted Nucleus Nucleus Nucleus 

CI24M Cl24M CI24M 


CI — cochlear implant. 

*Infant’s corrected age (born 7 weeks premature). 

+Length of time hearing aids were worn during all waking hours 
before cochlear implantation. 





tre at 12 months of age or earlier and be enrolled in 
an oral, auditory-verbal educational setting. In addi- 
tion, the subjects needed to be judged as having nor- 
mal cognitive skills on the Bayley’s Scales of Infant 
Development.’ Finally, the infants had to be recom- 
mended for cochlear implantation. 


In 2 years, 7 infants who met these criteria pre- 
sented at the Children’s Cochlear Implant Centre. To 
date, preliminary data are available for the first 3 sub- 
jects (S1, S2, and S3) and will be presented in this 
paper. Biographical data for each subject are provided 
in Table 1. 


PROCEDURES 


During the children’s prelinguistic stage and tran- 
sition to the linguistic stage, data were derived with 
videotaping, developmental charts, questionnaires, 
and interviewing. The following battery of tools was 
used. 


Communicative Intents. The Communicative In- 
tention Inventory (CII)!° was used to track the devel- 
opment of each infant’s communicative intents. The 


parent and child were videotaped interacting for up 
to 15 minutes with a selection of toys. The frequency 
and types of communicative intents, as described in 
Table 2,!° were identified at each infant’s first ses- 
sion at the Centre and then monthly over the next 3 
months of the candidacy evaluation process. Inter- 
actions were videotaped at monthly intervals for 6 
months after implant activation and then once more 
at 12 months after activation. 


The CII suggests that 4 sets of toys be played with 
for 8 minutes each, but the recommended toys are 
not ideal for children younger than 12 months of 
age!!; therefore, the protocol was altered to accom- 
modate the infants. At the subject’s assessment 12 
months after cochlear implantation, the protocol as 
presented by Coggins and Carpenter! was used, be- 
cause the children were better able to cope with the 
complete procedure at this age. 


Consensus agreement between the first author and 
an independent rater was used for scoring all video- 
tapes.!? This approach was adopted, first, to reduce 
the possibility of all ratings’ being skewed in either 
direction (either too easy or too hard) and, second, 
to prevent any skewing of individual child ratings or 
individual item ratings.!3 The intrarater scoring agree- 
ments were 92.2% and 95.2% for frequency and type 
of communicative intents, respectively, for 8 random- 
ly selected videotapes. These scores indicated good 
intrarater reliability. !4-!5 


Speech. The Developmental Vocal Assessment as 
described in Table 3!6 was used to examine the area 
of vocalizations in the prelinguistic stage. The chil- 
dren’s vocalizations were phonetically transcribed by 
broad phonetic transcription from the videotaped in- 
teractions of the communicative intent samples. 


Language. Two procedures were used to evaluate 
the language level of the subjects. S1 and S3 were 
assessed by the Reynell Developmental Scales IH 
(RDS-III), which is designed to assess the language 
skills of children from 18 months to 7 years of age 
and comprises comprehension and expressive lan- 
guage scales.!7 $3 was unable to respond to the sim- 


TABLE 2. TYPES OF COMMUNICATIVE INTENT 








Type 


Description 





Comment on action 
Comment on object 
Request for action 
Request for object 
Request for information 
Answering 
Acknowledging 
Protesting 

Adapted from Coggins and Carpenter. !" 


Calls listener’s attention to movement of object rather than object per se 
Calls listener’s attention to object identified by child 

Intentional behavior that directs listener to make object move 

Intentional behavior directing listener to provide object, usually out of reach 


Intentional behavior that directs listener to provide information about object, action, or location 
Responding to request for information with semantically appropriate data 

Providing notice that previous gesture or utterance was received 

Expressing disapproval of speaker’s action or utterance 





N$ 
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TABLE 3. STAGES OF DEVELOPMENTAL VOCAL ASSESSMENT 














Vocal Stage Description 
Stage 1 (0 to 2 mo) Vegetative and neutral sounds; some vowel-like sounds 
Stage 2 (2 to 4 mo) Vowel sounds predominate; cooing, pleasure sounds 


Stage 3 (4 to 6 mo) 
variation in intonation 


Stage 4 (6 to 10 mo) 
Stage 5 (10 to 12 mo) 


Consonant-vowel combinations, greater variations in vowels, stop and nasal consonants, “raspberries,” 


Canonical babbling, variation of intonation, may hear first word 
Variegated babbling, consonant-vowel and consonant-vowel-consonant combinations with adultlike 


intonation, use of jargon and protowords, approximation of meaningful single words 


Adapted from Paul. !6 





plest items on the RDS-III, so the Preschool Lan- 
guage Scale—3 was used. !8 


Comparisons of Language and Motor Develop- 
ment. The Vineland Adaptive Behavior Scales were 
used in conjunction with the other objective assess- 
ments mentioned in order to evaluate the children’s 
language development and compare it with their mo- 
tor development.!? The Vineland Scale comprises a 
standardized parent questionnaire and is administered 
via parent interview. The questionnaires evaluate 
communication (receptive, expressive, and written 


Frequency 


Before Implantation 


A 


Fig 1. Frequency of communicative intents for 
A) subject 1, B) subject 2, and C) subject 3. 


Frequency 


Before Implantation 






Frequency 


Before Implantation 


Cc 





language) and motor skills (gross and fine motor). 
They were administered 3 times during the preopera- 
tive stage (Ist, 6th, and 12th weeks) and on 3 occa- 
sions after operation (3, 6, and 12 months). 


RESULTS 
COMMUNICATIVE INTENTS 
Before Implantation. Each of the subjects exhib- 
ited nonverbal communicative intents at all 4 data 
collection points before implantation. However, of 
particular interest was the fact that before implanta- 
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tion, no verbal communicative intents were recorded 
for any of the subjects. This is shown in Fig 1. Al- 
though all infants used all types of nonverbal com- 
municative intents as outlined in Table 2, Comment 
on Action or Object and Request for Action or Ob- 
ject were used noticeably more than the other 4 types. 


After Implantation. Verbal communicative intents 
emerged 3 to 4 months after device activation for 
each of the subjects. This emergence of verbal com- 
municative intents occurred regardless of chronologi- 
cal age (ie, S1 at 18 months, S2 at 20 months, and S3 
at 16 months). By 6 months after implantation, an 
overall growth was evident in use of both nonverbal 
and verbal communicative intents, suggesting a gen- 
eral growth in communicative intent. At this stage, 
S1, S2, and S3 exhibited Comment on Action or Ob- 
ject and Request for Action. S1 and S2 also exhib- 
ited Acknowledging. 


Twelve Months After Implantation. A noticeable 
increase in verbal communicative intents was evident 
in both S1 and S3. At this stage, S1 demonstrated all 
types of verbal communicative intents with the ex- 
ception of Protesting. S3 also used all types except 
Request for Information and Acknowledging. S2 was 
not available for her annual assessment because of 
illness. 


Hearing Age. The data were also analyzed by ad- 
justing each subject’s age to the length of time they 
had been wearing hearing aids rather than their chro- 
nological age. This type of analysis revealed the de- 
velopment of verbal communicative intents to age- 





Fig 2. Speech development shown by A) chro- 
nological age and B) hearing age. 


appropriate for each of the subjects (ie, S1 at 11 
months, S2 at 13 months, and S3 at 14 months of 
hearing age). 

SPEECH 


For the videotaped interactions, each subject’s vo- 
calizations were recorded and marked on the Vocal 
Development Assessment. This form divides an in- 
fant’s vocalization type into 5 prelinguistic stages 
that are described in Table 3. The results were as fol- 
lows and as displayed in Fig 2A. 


At the first data collection point (before implanta- 
tion), all subjects had attained stage 1 vocalizations. 
This was at 11 months of age for S1 and S2, and at 8 
months of age for S3. Stage 2 vocalizations were 
achieved by S2 at 17 months and by S3 at 8 months. 
No stage 2 vocalizations were recorded for S1. Stage 
3 vocalizations were achieved by S1 at 12 months, 
by S2 at 13 months, and by S3 at 14 months. Stage 4 
vocalizations were achieved by S1 at 18 months, by 
S2 at 20 months, and by S3 at 15 months. Stage 5 
vocalizations were achieved by S1 at 18 months, by 
S2 at 20 months, and by S3 at 16 months. First words 
were achieved by S1 at 19 months, by S2 at 22 months, 
and by S3 at 17 months. 


The vocalization data were analyzed by adjusting 
the subjects’ ages to the length of time they had been 
wearing aids rather than their chronological age. The 
different stages of vocalization acquisition can be 
seen in Fig 2B. The following age equivalents refer 
to hearing age. 


Stage | was achieved by S1 and S2 at 4 months 


TABLE 4. LINGUISTIC ASSESSMENT 

















Subjeci 1 Subject 2 
Test Receptive Expressive Receptive Expressive 
Reynell Developmental Scales III 2y 1mo NA <l y 9 mo <1 y 9 mo 
Preschool Language Scale—3 NA 12 mo 1 y 6 mo 
Vineland Adaptive Behavior Scales 3 y 9 mo 1 y 8 mo 1 y 10 mo l1 y 4 mo 





Wright et al, Cochlear Implants in Infants 135 


Age (months) 


> 


Fig 3. Scores on Vineland Adaptive Behavior 
Scales for A) subject 1, B) subject 2, and C) sub- 
ject 3. PRE — preimplantation data collection 
point. 


Age (months) 


Age (months) 


and by S3 at 6 months. Again, this was this first data 
collection point for these infants. Stage 2 was achieved 
by S2 at 10 months, and by S3 at 6 months; stage 2 
was not evident in S1. Stage 3 was achieved by S1 at 
5 months, by S2 at 7 months, and by S3 at 12 months. 
Stage 4 was achieved by S1 at 11 months, by S2 at 
13 months, and by S3 at 13 months. Stage 5 was 
achieved by S1 at 11 months, by S2 at 13 months, 
and by S3 at 14 months. First words were achieved 
by S1 at 12 months, by S2 at 15 months, and by S3 
at 15 months. 

LANGUAGE 


A standardized linguistic assessment was admin- 
istered 12 months after cochlear implantation. A sum- 
mary of the data is presented in Table 4. One year 
after cochlear implantation, S1 was able to cope lin- 
guistically with the instructions and questions from 
the Reynell Developmental Scales III. She attained 
a receptive language score of 2 years 1 month. Ac- 
cording to the Communication Scale of the Vineland 
Adaptive Behavior Scales, she had a receptive lan- 


PRE 1 PRE3 


PRE 1 PRE 3 


© Chronological Age 
O Hearing Age 





3 Months 6 Months Annual 
After After 





3 Months 6 Months Annual 
After After Assessment 





PRE 1 PRE 3 3 Months 6 Months Annual 
After After Assessment 
Stage 


guage age of 3 years 9 months and an expressive 
language age of | year 8 months. 


S3 was unable to complete the Reynell Develop- 
mental Scales III, as she had raw scores of 0 and 2 
for comprehension and expression, respectively, 
equating to a language age of less than | year 9 
months. Therefore, her skills were tested with the Pre- 
school Language Scale—3, which placed her at 12 
months and 1 year 6 months for comprehension and 
expression, respectively. According to the Vineland 
Adaptive Behavior Scales, her receptive language 
skills were at 1 year 10 months, and her expressive 
language at | year 4 months. 


As mentioned above, S2 was not available for her 
annual assessment because of illness. 


COMPARISON OF MOTOR DEVELOPMENT AND 
LANGUAGE DEVELOPMENT 


Using the Vineland Adaptive Behavior Scales, we 
calculated each subject’s development in the area of 
communication and motor development from the par- 
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ent questionnaire. For this analysis, we compared re- 
ceptive language, expressive language, and motor age 
equivalents, results for which are charted in Fig 3. 


Before Implantation. For the 3 subjects, all areas 
of receptive language, expressive language, and mo- 
tor development were below their chronological age. 


After Implantation. The data indicated that the mo- 
tor and receptive language skills were at or above 
age level for S1 and S2 by 3 months after cochlear 
implantation and for S3 by 6 months after implanta- 
tion. Expressively, all 3 subjects remained below their 
chronological age. 


Hearing Age. When hearing age was calculated in 
terms of the time since the beginning of hearing aid 
compliance, expressive language skills for S1 and S2 
improved and were age-appropriate at all data points 
after implantation. However, S3’s expressive lan- 
guage skills remained below her calculated hearing 
age. 


COMPARISON OF MOTOR AND COMMUNICATION 
DOMAINS 


The 2 domains investigated were communication 
(receptive, expressive, and written language) and mo- 
tor development (gross and fine). They were calcu- 
lated in terms of standard scores and are shown in 
Fig 4. 


Before Implantation. Both S2 and S3 were within 
the normal range for communication; however, S1 
was below average. Motor development for all 3 sub- 
jects was in the low average range. 


After Implantation. All subjects displayed a large 
improvement in motor skills. It was interesting to note 
that during this period of rapid motor skill growth, 
their communication levels decreased and, in fact, 
fell below the average range. 





12 month 





12 month 


Fig 4. Scores on domains of Vineland Adaptive Behavior Scales 
for A) subject 1, B) subject 2, and C) subject 3. 
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DISCUSSION 


Findings from this small sample would suggest that 
infants who undergo cochlear implantation show im- 
provement in their communication and motor skills. 
All developed verbal communicative intents 3 to 4 
months after cochlear implantation, and all experi- 
enced enormous growth in receptive language and 
motor skills. Expressive language skills also im- 
proved after implantation, but not at such a rapid rate. 


Communicative Intents. After implantation, the 
subjects used most types of communicative intents. 
Regardless of age, verbal communicative intents 
emerged 3 to 4 months after cochlear implantation 
in all subjects. This finding has implications for chil- 
dren who undergo implantation at an earlier age than 
those in this study, such as at 8 to 9 months of age, as 
it might promote the emergence of first words and 
verbal communicative intents at 12 months of chro- 
nological age. If this were the case, it could effec- 
tively eliminate one of the early gaps between the ex- 
pressive language development of profoundly hear- 
ing-impaired infants and that of infants born with nor- 
mal hearing. 


Speech. It appeared that the early stages of speech 
development for deaf infants were achieved in the 
same order as in hearing infants. However, these 
speech skills are delayed as compared to their chro- 
nological age. This supports the findings of previous- 
ly published research.%3-6-8 Although their develop- 
ment was delayed in terms of chronological age, it 
was commensurate with their hearing age. This re- 
sult supports the body of evidence that recommends 
early identification and intervention for infants with 
congenital deafness.2-?? 


Language. At 12 months after cochlear implanta- 
tion, SI’s receptive language skills were commen- 
surate with her chronological age. Her expressive lan- 
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guage was below her chronological age but on par 
with her hearing age. S3’s expressive language was 
also commensurate with her hearing age, but her com- 
prehension had slowed and was equal to her cochlear 
implant age. This was considered unusual, as it is 
rare for children to develop expressive language skills 
ahead of their comprehension level. 


Comparison of Motor Development and Language 
Development. All 3 subjects had delayed gross mo- 
tor skills before cochlear implantation. After implan- 
tation, 2 subjects had moved into the expected range 
by 3 months, and 1 subject by 6 months. Although 
this large increase in the area of motor development 
occurred, language development decreased, as shown 
in Fig 4. This finding may have implications for 
monitoring the progress of children after implanta- 
tion, in particular those who have significant motor 
problems and developmental delays. 


SUMMARY 
These findings support the large body of research 


that advocates the importance of early identification 
of hearing loss and intervention, and confirm the im- 
portance of hearing aids and early development of 
audition even for those children with minimal hear- 
ing. However, the provision of cochlear implants to 
deaf infants seemed to assist these infants in acquir- 
ing several aspects of oral language, although addi- 
tional data are required to support this finding. This 
investigation also highlights the need for continued 
intensive habilitation after cochlear implantation. Al- 
though all 3 subjects’ communication skills improved 
after device implantation, some delays remained. The 
battery of tests described in this paper has proven 
valuable in tracking prelinguistic to linguistic devel- 
opment and assessing the impact of motor develop- 
ment. The importance of using the infant’s “hearing 
age” as a guide to tracking adequate progress cannot 
be ignored. However, before stronger conclusions can 
be drawn, a larger subject sample is required. Fur- 
ther data will be collected as infants continue to be 
referred for cochlear implantation at the Sydney Co- 
chlear Implant Centre, Australia. 
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CHANGING EXPECTATIONS FOR CHILDREN WITH COCHLEAR 
IMPLANTS 


JEAN SACHAR MOOG, MS 
ST LOUIS, MISSOURI 


Seventeen students with cochlear implants who were between 5 and 11 years of age and attended the Moog Center for Deaf 
Education school program were tested just before exiting the program. The Moog program is an intensive oral program that provides 
very focused instruction in spoken language and reading. Children leave the program when they are ready for a mainstream setting or 
when they are 11 years of age, whichever comes first. All of the children demonstrated open-set speech perception ranging from 36% 
to 100%. On a test of speech intelligibility, all students scored 90% or better. On language and reading tests, compared with the 
performance of normal-hearing children their age, more than 65% scored within the average range for language and more than 70% 
scored within the average range for reading. These data demonstrate what is possible for deaf children who benefit from a combina- 
tion of a cochlear implant and a highly focused oral education program. 


KEY WORDS — cochlear implant, deafness, elementary education, hearing, mainstreaming, oral deaf education. 


INTRODUCTION 


Children’s performance with cochlear implants has 
come a long way since the US Food and Drug Ad- 
ministration (FDA) first approved this device for deaf 
children a little more than a decade ago. Ten years 
ago, the goal of the implant manufacturers, the im- 
plant surgeons, and those involved in teaching deaf 
children was to demonstrate “some benefit” over 
what prelingually profoundly deaf children received 
from hearing aids. Our focus was primarily on devel- 
oping some closed-set speech perception and improv- 
ing phonetic-level speech production skills.! The Ear- 
ly Speech Perception Test? (ESP) was developed as 
a tool to document progress and ranged from differ- 
entiating words and phrases on the basis of their du- 
ration and rhythmic pattern (pattern perception) to 
differentiating monosyllables that differ in their vow- 
els, such as boat, bat, and boot (vowel perception). 
It was anticipated that those who were especially suc- 
cessful might demonstrate consonant perception by 
differentiating among rhyming words (eg, boat, coat, 
goat, note). Closed-set tests such as the Word Intelli- 
gibility by Picture Identification Test? (WIPI) were 
considered to present a most difficult challenge to 
implant users. In the beginning, open-set recognition 
at a level much above chance was not considered a 
realistic goal for prelingually, profoundly deaf chil- 
dren with cochlear implants. 


Now, 10 years later, our expectations have been 
raised substantially. The improved technology of co- 
chlear implants has had an impact on teaching meth- 
ods, and both have had a dramatic impact on deaf 
children’s progress. The progress of children with 


implants in the areas of speech perception, speech 
production, language, and reading has far exceeded 
the expectations of even the most optimistic. Deaf 
children with cochlear implants are learning to talk 
faster and better than anyone anticipated 10 years 
ago. The reasons for these changing expectations are 
varied and are associated with improvements in sev- 
eral areas. 


Improved Technology. The processing strategies 
of all 3 current FDA-approved devices are far supe- 
rior to the strategies available in 1990. Considerably 
more speech information is provided to the listener. 
Furthermore, improvements in internal device de- 
signs now provide more rapid delivery of informa- 
tion with smaller amounts of current. 


Improved Audiology. Schools and implant centers 
are demonstrating increased awareness of the impor- 
tance of the audiologist to the success of children 
with implants. In order for children to fully capital- 
ize on improved devices and coding capabilities, they 
must have access to audiologists who have the skills 
necessary to work with young children and to pro- 
gram their processors for maximum benefit in terms 
of discriminating speech. The quality of the child’s 
MAP affects the quality of what the child hears and, 
consequently, how the child learns to talk. There is a 
growing awareness that children’s implant MAPs 
need to be monitored regularly and adjusted imme- 
diately if the child, parent, or teacher recognizes a 
problem. Teachers are beginning to know the child’s 
abilities well enough to identify when the MAP may 
need adjusting, recognizing subtle signs, such as that 
the child’s speech is less clear, or that the child is 
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having more trouble understanding than is typical, 
perhaps saying “what?” more than usual. 


Improved Teaching. Providing a good MAP for a 
child is a crucial first step in improving the child’s 
potential for learning to talk. However, equally im- 
portant is the kind of spoken language instruction 
the child receives. Even though the implant itself is 
making it easier for deaf children to hear speech, what 
profoundly deaf children hear is not sufficient for 
them to learn to talk without specific instruction. In 
response to this need, competent teachers of the deaf 
are acquiring the knowledge and the skills necessary 
for training the child’s listening skills and capitaliz- 
ing on what the child hears through the implant for 
improving spoken language. 


In quality oral education programs, skilled and ex- 
perienced teachers provide direct teaching to accel- 
erate the development of spoken language. They en- 
gage children in specific lessons and capitalize on 
natural situations that occur to teach spoken language. 
This direct, focused instruction helps accelerate the 
learning of deaf children so that they can eventually 
catch up to their normal-hearing peers and begin 


_ learning in a regular educational setting from teach- 


ers without such specialized training. 


The initial emphasis is placed on helping the child 
to listen and develop auditory comprehension, be- 
cause the easiest way for a child to learn to talk is 
through what he or she hears. However, when chil- 
dren listen to the teacher’s corrections of their pro- 
ductions through an implant, they can also improve 
their own productions. Furthermore, when children 
are receptive to imitating the teacher’s model, they 
can learn to produce longer, more complex, and/or 
more precise sentences, and they can improve the 
clarity of their speech. 


In addition to making sure the child is listening to 
both himself or herself and the teacher, the teacher 
must actively listen to the child, as well. Teachers 
listen first to understand the child’s idea and respond 
to it. However, teachers listen not only to what the 
child says, but also to how he or she says it. Teach- 
ers listen carefully to the correctness of the vocabu- 
lary and syntax and the clarity of speech. Every time 
the child talks, the teacher has an opportunity and a 
responsibility to help the child talk better. A tech- 
nique of modeling and imitation helps improve the 
child’s production. The teacher models an improved 


version of what the child said, highlighting some as- ` 


pect for improvement, and the child is expected to 
imitate that model. The child’s imitated production 
helps the child in comprehension, provides practice 
in producing the added element, and provides feed- 


back to the teacher regarding the child’s articulation, 
vocabulary, and syntax. The child’s errors and omis- 
sions provide guidance on what to correct and what 
to teach next. 


When teachers listen attentively, children like to 
talk, know they are expected to talk, and talk often. 
The more they talk, the better they get at talking. 
When children are taught in small groups of 2 or 3, 
they have more turns to talk, and teachers can give 
them the time to talk. In addition to getting more prac- 
tice talking, the children also stay more engaged in 
the activity and have the desire to talk even more. 


Improved Educational Programs. Educational pro- 
grams for children with cochlear implants are plac- 
ing an increasing emphasis on the development of 
spoken language. The most successful programs are 
based on the belief that deaf children learn to talk by 
talking. In these programs, children are immersed in 
spoken language and are dependent on talking to com- 
municate. Although learning to listen is important, 
listening is not enough. For deaf children, produc- 
ing spoken language may not automatically follow 
comprehension as it does for normal-hearing chil- 
dren. A focus on talking with many opportunities to 
do so accelerates progress in learning to talk. 


Successful programs also provide highly individu- 
alized instruction. A teacher-pupil ratio of at least 1 
to 4 makes it possible for teachers to know precisely 
the current skill levels of the children they teach. 
Small group instruction makes it possible for skilled 
teachers to gear instruction to be at just the right lev- 
el for each child to have maximum challenge and 
achieve maximum success. 


Children with implants also benefit from programs 
that provide specific teaching objectives. A sequen- 
tial list of skills in their expected order of develop- 
ment in speech perception,’ speech production,’ and 
language®’ guides teachers in selecting objectives. 
Teachers can identify which skills the child has ac- 
quired and choose objectives for teaching from the 
next level of skills. Evaluation of the child’s perfor- 
mance, during and after each lesson, lets the teacher 
know how well the child is progressing toward the 
targeted objectives. 


Improved Family Support. As the use of cochlear 
implants becomes more widespread, parents are re- 
ceiving more training in supporting their child’s au- 
ditory and speech development. Parents demonstrate 
support by keeping their child’s cochlear implant in 
good working order and identifying when there is a 
problem, by encouraging and helping their child to 
talk, and by being an advocate for their child in all 
situations. 
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SUBJECT CHARACTERISTICS 











Range Mean 
Age at implantation 2y5moto7y8 mo 4y3mo 
Duration of implant use ly6moto7y3mo 4y6mo 


Age at exiting and/or 
testing 5y9motolly9mo 8y9 mo 
Age of diagnosis for 12 
congenitally deaf 


children 6 mo to 2 y 6 mo l y 3 mo 
Onset of deafness for 5 

children deafened by 

meningitis 7 mo to 3 y | mo 2y6mo 
Age receiving oral 

education services lylmoto3y5mo 2y5mo 
Performance IQ 90 to 121 108 





Parents are as important as teachers in terms of 
helping their children capitalize on their implants and 
acquire spoken language. As parents gain an under- 
standing of the importance of the implant in their 
child’s learning to “hear” and talk, they become con- 
scientious in monitoring their child’s implant, mak- 
ing sure it is working as it should and getting any 
problems fixed. They also encourage and support 
their child by using spoken language in their home, 
and work with their child to practice the spoken lan- 
guage skills learned at school. Parents learn to ad- 
just their language to a level the child can under- 
stand and learn strategies to 1) help their child un- 
derstand new and more difficult language, 2) encour- 
age their child to talk, and 3) expand their child’s 
spoken communication skills. Because parents ex- 
pect their child to communicate with speech and pro- 
vide the support to achieve this, their child contin- 
ues to learn and practice talking at home as well as 
at school. 


NEW LEVELS ACHIEVED UNDER OPTIMAL 
CIRCUMSTANCES 


For the past 5 years, children enrolled at the Moog 
Center for Deaf Education have experienced all of 
the circumstances described above. The Moog School 
provides a full-day special education program for the 
first few years after cochlear implantation. Children 
leave this program whenever their language and read- 
ing scores are at grade level compared with their age, 
and they are ready for a mainstream class. This can 
be anywhere from kindergarten to fifth grade. When 
children are 11 years old, they leave the school re- 
gardless of whether their language and reading abili- 
ties are at grade level or not. 


Subjects. The 17 subjects included in this sample 
are all of the children with cochlear implants who 
left the school in the past four years, 1997 through 
2000. The sample characteristics at the time the chil- 
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Subject 


Fig 1. Percent correct open-set speech perception scores 
on Phonetically Balanced—Kindergarten (PBK) word list 
for 17 children. 


dren left the school are summarized in the Table. At 
the time of testing, the children ranged in age from 5 
years 9 months to 11 years 9 months. The age at im- 
plantation ranged from 2 years 5 months to 7 years 8 
months, and the duration of implant use ranged from 
l year 6 months to 7 years 3 months. Performance 
IQ scores averaged 108, ranging from 90 to 121, as 
measured by the Wechsler Preschool and Primary 
Scale of Intelligence—Revised® or the Wechsler In- 
telligence Scale for Children HI.’ Twelve of the chil- 
dren were deaf from birth, and 5 were deafened by 
meningitis — 3 before | year of age, 1 at 2!/2 years 
of age, and | at 3 years of age. Those deaf at birth 
received that diagnosis relatively early, ranging from 
6 months to 30 months of age; the average age of 
diagnosis was 15 months. Fourteen of the children 
were enrolled in an oral education program from the 
start, and the other 3 were enrolled in an oral pro- 
gram by 3 years of age. 


Test Battery. A battery of tests was administered 
to evaluate these children in the areas of speech per- 
ception, speech production, language, and reading. 
The test scores reported are from the last testing be- 
fore the children left the Moog Center special edu- 
cation program to enter mainstream classes. 


Percent Correct 
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Fig 2. Percent correct speech intelligibility scores on Pic- 
ture Speech Intelligibility Evaluation (Picture-SPINE) 
speech production test for 17 children. 
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Fig 3. Average language quotient scores across 3 lan- 
guage measures (Peabody Picture Vocabulary Test, Ex- 
pressive One-Word Picture Vocabulary Test, Clinical 
Evaluation of Language Fundamentals) for 17 children. 
Horizontal line indicates average language quotient for 
normal-hearing children. 


Speech Perception. Children were tested with the 
Phonetically Balanced—Kindergarten (PBK)!° word 
lists, an open-set test that requires the child to listen 
to presentations of 1-syllable words and repeat what 
is heard. There are no pictures of the words from 
which to choose. A 25-word list was administered, 
auditory only, in the sound field at 70 dB sound pres- 
sure level, by means of recorded speech. The results 
plotted in Fig | represent individual scores ordered 
from the lowest score to the highest score. The PBK 
scores ranged from 36% to 100%, with 88% of the 
sample scoring more than 50% correct. 


Speech Intelligibility. The Picture Speech Intelli- 
gibility Evaluation (Picture-SPINE)!! is designed to 
measure the intelligibility of the speech of profoundly 
deaf children. Four sets of 50 pictures (2 each of 25 
different items) are given to the child 1 at a time. 
The examiner does not see the pictures, but has a list 
of the 25 possible words depicted in the set. As the 
child names each picture, the examiner writes down 
what is said. The percentage of pictures correctly 
identified by the examiner is the child’s score. All of 
the words exemplify specific phonemic contrasts 
known to be particularly good predictors of overall 
speech intelligibility among deaf children. The scores 
on this test correlated at .90 or better with speech 
intelligibility scores obtained from speech samples 
played to listeners, indicating that scores on the Pic- 
ture-SPINE are highly predictive of speech intelligi- 
bility. Figure 2 depicts the individual speech intelli- 
gibility scores of the 17 subjects. 


Language. A language score was calculated by av- 
eraging the scores on a receptive vocabulary test, an 
expressive vocabulary test, and a more general lan- 
guage test (Fig 3). All of these measures yield age 
and quotient scores based on the performance of nor- 
mal-hearing children. The tests in the language bat- 
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Fig 4. Reading quotient scores (Gates MacGinitie Read- 
ing Test or Stanford Achievement Test) averaged for vo- 
cabulary and comprehension for 17 children. Horizontal 
line indicates average reading quotient for normal-hear- 
ing children. 


tery are listed below. 


The Peabody Picture Vocabulary Test (PPVT; third 
edition)!? is an individually administered test of re- 
ceptive vocabulary, in which the examiner says a word 
and the child is expected to look at a set of 4 pictures 
and point to the picture that illustrates the word. The 
Expressive One-Word Picture Vocabulary Test 
(EOWPVT)!3 is an individually administered test de- 
signed to evaluate expressive vocabulary in which 
the examiner shows the child a series of pictures | at 
a time and the child is expected to name them. The 
third edition of the Clinical Evaluation of Language 
Fundamentals (CELF-3)!4 is an individually admin- 
istered test that is divided into 6 subtests, 3 of which 
assess receptive language skills and 3 of which as- 
sess expressive language skills. 


Reading. Reading skills were measured by | of 2 
standardized reading tests. Both of these are group- 
administered paper-and-pencil tests and yield grade- 
equivalent scores based on the scores of normal-hear- 
ing children. These grade scores are converted to age 
scores and divided by the child’s chronological age 
at the time of testing to yield a quotient score repre- 
senting that child’s reading status in relation to nor- 
mal-hearing age-mates. A child reading precisely at 
age level would score 100, similar to an IQ score of 
100. The individual reading quotient scores are shown 
in Fig 4. 

The Gates MacGinitie Reading Test (third edi- 
tion)!5 was used with children younger than 8 years 
of age. The vocabulary and comprehension subtests 
at this level consist of simple vocabulary and com- 
prehension of a few critical elements in sentences. 


The Stanford Achievement Tests (ninth edition)!® 
was administered to children 8 years of age and older. 
The vocabulary and reading comprehension subtests 
require increasing inference and complex language 
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skills as the grade level increases. 


RESULTS 


In the areas of speech perception and speech in- 
telligibility, absolute levels achieved were measured. 
Language and reading skills were compared to those 
of normal-hearing age-mates. 


All of the students demonstrated open-set word 
recognition, as measured on a PBK word list, and all 
but 2 scored better than 50% correct on this test. All 
students scored 90% or better on the Picture-SPINE, 
which measures the intelligibility of their speech. A 
score of 90% is described in the Picture-SPINE test 
manual as “excellent intelligibility: naive listeners 
can understand most of the child’s speech at first in- 
troduction.” 


Language was compared to that of hearing age- 
mates by means of the PPVT and the EOWPVT for 
vocabulary and the CELF-3 for more general language 
skills. When quotient scores were averaged across 
the 3 measures, more than 65% of the children scored 
within the average range. Reading was measured with 
the Gates-MacGinitie Survey for children younger 
than 8 years of age and the Stanford Achievement 
Tests for children 8 years of age and older. More than 
70% of the students scored within the average range 
for their age. 


CONCLUSION 


These levels of achievement for prelingually pro- 
foundly deaf children with cochlear implants far ex- 
ceed the expectations of just a decade ago. At that 
time, only a very small percentage of children with 
cochlear implants achieved even 12% open-set speech 
perception.! Although it was expected that improved 
speech perception would result in improvements in 
speech intelligibility, it was not expected that chil- 
dren could achieve 90% to 100% intelligibility. It 
was also hypothesized that improvements in percep- 
tion would eventually result in improved levels of 
language and reading, but such a profound impact 
was not expected. 


These data demonstrate what is now possible for 
deaf children who benefit from a combination of a 
cochlear implant and optimum intervention in an 
ideal school program, and establish a new standard 
of achievement for deaf children. Many, if not most, 
of the characteristics described could be implemented 
in programs for children with cochlear implants. Pro- 
viding such programs, although expensive, is re- 
quired only for a short time, after which a majority 
of children will be put into mainstream programs with 
skills comparable to those of their hearing age-mates. 
For deaf children, achieving this level of performance 
will affect everything about their future lives — so- 
cially, academically, and economically. 
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background to establish an independent voice disorders pro- 
gram at his or her home institution. There are numerous 
clinical and basic science research opportunities available as 
well as the option to work collaboratively with local music 
conservatories. Certification by the American Board of Oto- 
laryngology or eligibility is a prerequisite. 


Direct inquiries to: Steven M. Zeitels, MD, Director, Division 
of Laryngology, Massachusetts Eye and Ear Infirmary, 
243 Charles Street, Boston, MA 02114; 
Phone: (617) 573-3557, Fax: (617) 573-3068 
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Courses on Middle Ear and 
Lateral Skull Base Surgery 


Course Director: 
Mario Sanna, MD 

This cadaver hands-on workshop provides 
the individual opportunity to participate in: 
e Live surgery every morning 
e Video presentation of techniques and clini- 

cal cases 
e Didactic presentations 
e Performing the surgical procedures on 

specially prepared bones and half-heads 
* Discussions 


Skull Base Surgery, October 21-25, 2002 
Middle Ear Surgery, November 18-22, 2002 


For additional information, contact: 


Secretary Gruppo Otologico 
Via Morigi, 41 
29100 Piacenza, Italy 
telephone: 39-0523-754362 
fax: 39-0523-453708 
E-mail: Corsi.Sanna@ gruppootologico.it 
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KAISER PERMANENTE 
More people turn to us for good health. 


DENVER, COLORADO 
OTOLARYNGOLOGY 


The Colorado Permanente Medical Group 
is seeking a Board-eligible/certified otolar- 
yngologist for a stimulating referral prac- 
tice in a prepaid group setting. Salaried po- 
sition with excellent benefits. Denver offers 
abundant four-season outdoor activities en- 
hanced by many community amenities. 


Contact: 


Physician Recruitment 
Colorado Permanente Medical Group 
10350 E Dakota Ave 
Denver, CO 80231 
Phone: 303-344-7838 
Fax: 303-344-7818 


EOE,M/F,V/H 





SOUTHERN CALIFORNIA 


Magan Medical Clinic, Inc, alarge, indepen- 
dent, physician-owned multispecialty group 
serving the east San Gabriel Valley for over 
80 years, is seeking a Board-eligible or 
Board-certified otolaryngologist. The suc- 
cessful candidate will practice in an excel- 
lent setting with a mix of fee-for-service and 
HMO patients. The position is full-time. In 
return for your expertise, we offer an attrac- 
tive compensation package with outstand- 
ing benefits, including a generous 401K. 


Qualified candidates are invited to apply by 
submitting a CV and letter of interest to: 


Magan Medical Clinic, Inc 
Attn: Human Resources 
420 W Rowland St 
Covina, CA 91723 
telephone: (626) 331-6411 
fax: (626) 251-1550 
e-mail: recruit@maganclinic.com 
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NEW-GENERATION PULSED CARBON DIOXIDE LASER: 
COMPARATIVE EFFECTS ON VOCAL FOLD WOUND HEALING 


C. GAELYN GARRETT, MD 


Lou REINISCH, PHD _ 


NASHVILLE, TENNESSEE 


We investigated wound healing of vocal fold mucosal excisions using a canine model, comparing cold steel, the continuous 
wave (CW) carbon dioxide (CO2) laser, and a new, microsecond-pulsed CO2 laser. The thermal injury to the surrounding normal 
tissues produced by the laser increases the risk for scar formation and poor functional voice outcome. This injury may be reduced 
with a pulsed CO2 laser. Acute, 2-week, and 6-week studies of 15 dogs were made with blinded observers and nonparametric statis- 
tical analyses. Histologic comparison showed less thermal injury and scar formation in the vocal folds treated with the pulsed CO2 
laser than with the CW CO? laser. Functional outcomes as studied with laryngeal videostroboscopy revealed better vibratory charac- 
teristics with the pulsed CO2 laser as compared with the CW COz laser. No differences were seen on these measures between the 
pulsed laser and cold techniques. Pulsed CO2 lasers may offer the ease of laser laryngology and the superior wound healing of cold 


steel. 


KEY WORDS — laser surgery, vocal fold, wound healing. 


INTRODUCTION 


When presented with a benign non-neoplastic vo- 
cal fold lesion such as a polyp or cyst, the laryngolo- 
gist has both cold and laser techniques to choose from 
when the decision for surgical excision is made. Both 
have advantages and disadvantages. The major disad- 


` vantage of currently available carbon dioxide (CO2) 


laser techniques is the associated thermal damage to 
surrounding normal tissue and the resulting effect 
on wound healing.!3 This alone has led to the deci- 
sion by most laryngeal surgeons to use cold nonlaser 
techniques to optimize functional voice outcome. The 
major disadvantage of cold microlaryngeal tech- 
niques is the inherent length of the moment arm re- 
quired to reach the vocal folds through the laryngo- 
scope. The CO2 laser delivered through a microspot 
micromanipulator attached to an operating micro- 
scope effectively eliminates the moment arm, theoret- 
ically minimizing operator error in making incisions. 
Thus, a laser offering the advantages of clinically 
available CO2 lasers but associated with less sur- 
rounding thermal injury would be ideal for microla- 


. ryngeal surgery. Various modifications to the CO2 


laser have been made with this goal in mind, includ- 
ing the use of reduced spot size, computer-controlled 
beam delivery, and variations in the energy pulse du- 
ration and repetition rate. Although improvements 


_ have been made, the wound healing remains less opti- 


mal than with cold techniques. 


The laser pulse structure has been shown to influ- 
ence the degree of thermal damage to tissue. Most 
CO? lasers in current clinical use are continuous wave 
(CW) lasers. Most microlaryngeal applications use 
the laser in a mechanically shuttered mode, common- 
ly referred to as a pulsed mode, using either single 
“pulses” or repeat “pulses.” This is somewhat of a 
misnomer, in that the laser beam structure is not truly 
pulsed. Walsh et al‘ first reported on the advantages 
of a short-pulsed CO2 laser in reducing thermal in- 
jury as compared to a CW CO2 laser in a shuttered 
mode. More recent studies have corroborated these 


findings." The pulsed lasers take advantage of ther- . 


modynamic tissue parameters known as thermal re- 
laxation time and thermal recovery time. The pulse 
structure is such that thermal diffusion is minimized, 
lessening thermal injury to adjacent tissue. A recent 
study by Sanders and Reinisch® evaluated wound 
healing of incisions in rat dermis using a prototype 
CO2 laser with a 7.5-us pulse duration. This short- 
pulsed CO2 laser resulted in comparable healing as 
compared to scalpel incisions and improved healing 
as compared to incisions made with a CW CO? la- 
ser. The human vocal fold, with its unique lamina 
propria structure, is especially at risk for potential 
functional effects of scar caused by deep thermal in- 


jury. 





From the Department of Otolaryngology, Vanderbilt Bill Wilkerson Center for Otolaryngology and Communication Sciences, Vanderbilt Uni- 
versity Medical Center, Nashville, Tennessee. Supported by a grant from the American Laryngological Association. The pulsed laser was 
provided on a no-cost loan from Argus Photonics Group, Jupiter, Florida. This study was performed in accordance with the PHS Policy on 
Humane Care and Use of Laboratory Animals, the NIH Guide for the Care and Use of Laboratory Animals, and the Animal Welfare Act (7 
U.S.C. et seq.); the animal use protocol was approved by the Institutional Animal Care and Use Committee (IACUC) of Vanderbilt University. 
Presented at the meeting of the American Laryngological Association, Palm Desert, California, May 12-13, 2001. 


CORRESPONDENCE — C. Gaelyn Garrett, MD, Dept of Otolaryngology, Vanderbilt Voice Center, 1500 21st Ave S, Suite 2700, Nashville, TN 


37212. 
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We investigated wound healing of vocal fold mu- 
cosa following microflap excision using a previously 
described canine model, comparing cold steel, a CW 
CO? laser, and a prototype microsecond-pulsed CO2 
laser.” Both histologic and functional parameters 
were examined. 


MATERIALS AND METHODS 


All procedures and actions were approved and 
monitored by the Institutional Animal Care and Use 
Committee. Thirteen mongrel dogs underwent sus- 
pension microlaryngoscopy and bilateral microflap 
excision of vocal fold mucosa by 1 of 3 techniques: 
1) cold steel, 2) CW CO? laser in a “pulsed mode,” 
and 3) 7.5-us pulsed CO2 laser. Each animal had 2 
different techniques used, | for each vocal fold. The 
animals were divided into 2 groups according to time 
of sacrifice following surgery: at 2 weeks or at 6 
weeks. Within each group, they were further subdi- 
vided into 2 groups according to the techniques used. 
In all dogs, the pulsed CO2 laser was used on | vo- 
cal fold, chosen randomly. The other vocal fold was 
treated either with cold steel or the CW CO? laser. 
Three dogs in the 6-week group had vocal fold exci- 
sions with the pulsed CO2 laser and cold steel. Four 
dogs in the 6-week group had vocal fold excisions 
with the pulsed CO2 laser and the CW CO? laser. 
Three dogs in the 2-week group had vocal fold exci- 
sions with the pulsed CO2 laser and cold steel. One 
animal was excluded because of inability to perform 
laryngeal videostroboscopy (LVS) for functional eval- 
uation. Three dogs in the 2-week group had vocal 
fold excisions with the pulsed CO2 laser and the CW 
CO? laser. Each of the 6-week dogs were evaluated 
with LVS acutely, at 2 weeks and at 6 weeks, just 
before sacrifice. The larynges were harvested for his- 
tologic examination after painless sacrifice of the ani- 
mal. 


For each procedure, the animal was placed supine 
on an operating table and anesthetized with intramus- 
cular agents (telazol 4 mg/kg, xylazine hydrochlo- 
ride 0.275 mg/kg) and allowed to breathe spontane- 
ously. Intraoperative monitoring consisted of an elec- 
trocardiogram and measurements of the depth and 
quantity of spontaneous respiration and the level of 
pain threshold. Each surviving animal was given pre- 
operative and postoperative antibiotics (penicillin G 
procaine and gentamicin sulfate). Postoperative pain 
was managed with flunixin meglumine. 


Suspension microlaryngoscopy was performed by 
inserting a modified Ossoff-Karlen laryngoscope 
(Pilling, Fort Washington, Pennsylvania). Laryngeal 
videostroboscopy was performed with percutaneous 
delivery of air through a 14-gauge needle inserted at 


approximately the fourth tracheal interspace and ex- 
ternal manual pressure on the posterior larynx to 
achieve adequate adduction as recently described.’ 
The amount of airflow used was the minimum flow 
needed to trigger vocal fold vibration and phonation. 


Direct microlaryngoscopy with microflap excision 
of vocal fold mucosa was performed during the ini- 
tial procedure after completion of LVS. With the ani- 
mal still anesthetized, an operating microscope (mod- 
el OPMI-DFC, Carl Zeiss, Inc, Thornwood, Pennsyl- 
vania) with a 400-mm lens was positioned for a mag- 
nified view of the larynx. A microflap incision was 
made near the medial edge along the middle third of 
each membranous vocal fold. The incision was made 
by using | of the 3 techniques listed previously. The 
sides were chosen randomly by a coin toss. The CW 
CO2 laser (model 1060, Sharplan Lasers, Inc, Allen- 
dale, New Jersey) was used in the repeat pulse mode 
at settings of 5 W, 0.1-second on-time, and a frequen- 
cy of | Hz. The beam was delivered through an articu- 
lated arm and focused to a 250-um spot size with a 
microspot micromanipulator (model 710, Sharplan 
Lasers, Inc) attached to the operating microscope 
with a Gaussian (TEM00 [transverse electromagnet- 
ic mode]) beam profile. The pulsed laser, a sealed 
transverse electric atmosphere CO? laser (Argus Pho- 
tonics Group, Jupiter, Florida), used a 7.5-us pulse 
width and was delivered at a 3-Hz repetition rate. 
This laser delivers a 35- to 55-mJ pulse energy with 
a Gaussian (TEMO00) beam profile and a focused 250- 
um spot size using the same articulated arm and mi- 
crospot micromanipulator as that used for the CW 
CO? laser. 


After the initial incision, a surgical plane was de- 
veloped at the depth of the incision with microlaryn- 
geal elevators within the more superficial aspect of 
the lamina propria. This layer is comparable to the 
surgical plane seen with microflap procedures within 
the superficial layer of the lamina propria in the hu- 
man vocal fold. An ellipse of the elevated mucosal 
cover including the epithelium and that portion of 
the superficial lamina propria superficial to the micro- 
flap plane was excised by the same technique used 
for the initial incision. All surgical procedures were 
performed by the first author (Fig 1). 


Two additional dogs were treated and sacrificed 
acutely. In these 2 animals, the goal was to measure 
the depth of the incision and the amount of thermal 
injury produced by the CW CO? laser and the pulsed 
CO2 laser. No LVS was performed. The procedure 
for microsuspension laryngoscopy differed from the 
other groups in that only the initial single incision 
was made in each vocal fold. One pass only of the 
laser was performed. 
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Fig 1. Sketch shows approximate area and location of 
mucosa removed by microflap elevation technique from 
each vocal fold. 


At the time of sacrifice, each animal was given a 
lethal dose of barbiturate while still under general 
anesthesia. The larynges were harvested and kept in 
10% formalin for several days before sectioning. Co- 
ronal sections at the midfold level were processed in 
the medical center’s division of histopathology. 


Histology. The specimens were decalcified, sec- 
tioned with a rotary microtome (4 to 5 um), and 
stained by standard techniques. Hematoxylin and eo- 
sin (H & E) staining was used in the analysis of the 
inflammatory infiltrate and to identify fibroblasts. 
Masson’s trichrome stain was used in the analysis of 
fibrin deposition in the 2- and 6-week animals and 
of thermal damage in the acute animals. Movat’s 
modification of the pentachrome stain was used to 
evaluate the layers of the lamina propria and its con- 
nective tissue components in the 6-week animals. 
Qualitative observations were made of the sections 
to note the density and quality of wound healing com- 
ponents. The H & E sections were examined for in- 
flammatory infiltrate. The Masson’s sections were 
examined for the amount of coagulative necrosis in 
the acute animals and for the relative concentrations 
of fibroblasts and collagen in the lamina propria of 
the 2- and 6-week animals. The Masson’s and Mo- 
vat’s sections were examined under polarized light 
to assess birefringence for uniformity of the collagen 
fibers, indicative of the quality of wound healing. 
All histologic examinations were blinded as to which 
technique was used on each section. 


Laryngeal Videostroboscopy Analysis. All LVS 
samples were condensed with a video editor into a 
single tape of approximately 45-second samples. 


Each 6-week animal had 3 LVS samples (preopera- 
tive, 2 weeks, and 6 weeks). The animals were ran- 
domly assigned a number so that the evaluators would 
be blinded to the treated side. For each animal, the 3 
samples were placed in random order, but grouped 
together on the tape so that the evaluators were aware 
the samples were from the same animal, but were 
not aware of the time period of the examination. The 
3 evaluators were all trained in the acquisition and 
interpretation of human and canine LVS. Vocal fold 
anatomy and LVS parameters were described. Three 
scores were given for each sample: | each for the 
appearance of the mucosal surface (rated | to 5, with 
5 being normal), mucosal wave pattern (also rated | 
to 5), and vertical phase difference (VPD; rated 1 if 
absent and 2 if present). The code was broken and a 
single score was calculated for each animal at each 
time period by summing the 3 scores at each time 
period. The data were inputted into a statistics com- 
puter program (Stat View, Abacus Concepts, Berke- 
ley, California) and analyzed by Student’s /-test with 
a paired comparison. A p value of less than .05 was 
considered statistically significant. 


RESULTS 


All animals survived the designated period of the 
study. No complications were encountered. As men- 
tioned above, | animal was excluded from the study 
because of inability to perform LVS. 


Qualitative Surgical Observations. Previous stud- 
ies have affirmed the use of the dog as an animal 
model for functional and histologic analysis follow- 
ing vocal fold microflap surgery. There were ob- 
served differences during the operation in the perfor- 
mance of the microflap incision with the 3 techniques. 
The scalpel incisions were similar to those performed 
clinically in humans and resulted in a surgical plane 
superficially within the lamina propria. The incisions 
made with the CW CO2 laser visibly caused char at 
the edges of the incision, as well as at the depth of 
the mucosal excision within the lamina propria. The 
incisions and excisions made with the pulsed CO2 
laser did not cause char formation, but had less he- 
mostasis than those made with the CW CO? laser 
(Fig 2). The depths of the laser incisions were sub- 
jectively at depths similar to those of the scalpel in- 
cisions, as observed by the surgeon. 


Histology. The vocal folds from the acute animals 
revealed differences in depth of the ablation crater 
and extent of thermal injury between incisions made 
with the CW CO? laser and those made with the pulsed 
CO2 laser. The maximum depths of the ablation cra- 
ters in the 2 vocal folds treated with the CW CO2 
were 1.9 mm and 950 um. The maximum depths of 
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Fig 2. Intraoperative photograph of vocal fold treated 
with pulsed carbon dioxide (CO2) laser (left) and con- 
tinuous wave (CW) CO? laser (right), Note visible char 
in right vocal fold and less hemostasis in left vocal fold. 


the ablation craters in the 2 vocal folds treated with 
the pulsed CO2 laser were 950 um and 190 um. The 
amount of thermal injury to the tissues surrounding 
the ablation craters also differed between the 2 la- 
sers. Those vocal folds treated with the CW CO? la- 
ser had thermal injury extending between 190 and 
285 um beyond the ablation crater, while those vo- 
cal folds treated with the pulsed CO2 laser had less 
than 50 um of thermal injury to none (Fig 3). 

All vocal folds in the surviving animals had re-epi- 
thelialized by 2 weeks after surgery. In both the 2- 
week and 6-week groups, no significant differences 
were seen in inflammatory infiltrate among the 3 tech- 
niques used. Examination of connective tissue com- 
ponents was initially done with normal optical micros- 
copy. No obvious qualitative differences were noted 
among the 3 groups at 2 or 6 weeks. However, exam- 


ination of the specimens with polarizing filters did 
reveal qualitative differences. All but | of the vocal 
folds treated with cold steel excisions in the 2-week 
group had a normal distribution of mature collagen 
in the superficial lamina propria, as evidenced by 
the birefringence of the fibers. The vocal folds treated 
with both types of CO2 laser in the 2-week group 
had areas of reduced birefringence in the superficial 
lamina propria, measuring between 475 um and 950 
um deep to the epithelium in the CW CO2 group and 
between 190 um and 475 um in the pulsed CO2 
group. The 3 vocal folds in the 6-week group treated 
with cold steel had normal collagen distribution and 
birefringence. The 4 vocal folds treated with the CW 
CO? laser had areas of reduced birefringence extend- 
ing up to 950 um deep to the epithelium. None of the 
vocal folds in the CW CO2 group had normal colla- 
gen distribution and birefringence. Two of the 4 vo- 
cal folds in this group had areas of granulation tis- 
sue in the superficial lamina propria indicative of de- 
layed healing. Five of the 7 vocal folds treated with 
the pulsed CO2 laser had normal collagen distribu- 
tion and birefringence at 6 weeks. The other 2 had 
areas of granulation tissue within the superficial lam- 
ina propria (Fig 4). 


Laryngeal Videostroboscopy. Comparisons of the 
LVS data were made at the time of the initial proce- 
dure, at 2 weeks, and at 6 weeks. There were no sig- 
nificant differences between the vocal folds of each 
animal at the preoperative evaluation. At the 2-week 
evaluation, no statistically significant differences 
were observed between the CW CO? laser excisions 
and the pulsed CO2 laser excisions (p = .32) or be- 
tween the cold steel excisions and the pulsed CO2 
laser excisions (p =.7). At the 6-week evaluation, no 
differences were observed between the cold steel and 





Fig 3. Ablation craters in vocal folds following single pass of laser (Masson’s, original x80). A) CW CO? laser. Ablation crater 
is approximately 950 um deep, with thermal injury extending another 190 um beyond (arrows). B) Pulsed CO2 laser. Ablation 
crater is approximately 190 um deep, with no measurable deeper thermal injury. 


Garrett & Reinisch, New-Generation Pulsed Carbon Dioxide Laser 175 





Fig 4. Epithelium and lamina propria of vocal fold in area of healed mucosal excision (Masson’s, original x80 with polarizing 
filters). A) CW CO? laser. Shows area of non-birefringence extending approximately 950 um deep to epithelium. B) Pulsed 
CO? laser. Shows normal distribution of birefringent collagen deep to epithelium. 


pulsed CO2 laser excisions (p = .53). Comparison 
between the CW CO? laser excisions and the pulsed 
CO2 laser excisions at 6 weeks, however, revealed 
improved LVS parameters in the vocal folds treated 
with the pulsed CO2 laser (p < .02). 


DISCUSSION 


The CO2 laser is a versatile tool in many surgical 
fields. In otolaryngology, it is used in both ablative 
and incisive procedures; it offers hemostasis and sur- 
gical precision in procedures in which a convention- 
al scalpel incision would be restricted. The CO2 la- 
ser emits light in the invisible infrared region of the 
spectrum at 10.6 um, at which it is strongly absorbed 
by water in the target tissue. The laser energy causes 
the water to vaporize, resulting in the thermal abla- 
tion of tissue. A side effect is that the tissue adjacent 
to the ablation crater is also affected by the thermal 
energy, which may result in undesirable scar forma- 
tion in previously healthy tissue. In some anatomic 
areas, such as the oral cavity, this excess scar forma- 
tion in the lamina propria or on previously healthy 
epithelium of the vocal folds may have unfortunate 
and unintended effects on voice outcome. For this 
reason, use of the CO2 laser and other lasers with 
even deeper thermal profiles, such as the neodymium: 
yttrium-aluminum-garnet and KTP lasers, is limited 
by most microlaryngeal surgeons to neoplastic epi- 
thelial lesions and stenotic lesions. 


Two time durations are important in the under- 
standing of laser-tissue interactions and subsequent 
wound healing. Thermal recovery time is the amount 
of time it takes tissue to return to its original temper- 
ature after an ablation pulse has been delivered. The 
thermal recovery time serves as a limit to how fre- 


quently the laser pulses can be repeated without heat 
buildup in the tissue and an increase in thermal in 


jury. This time period is usually between 200 and 


300 ms in skin for a 5-W CO? laser. Therefore, repe 
tition rates greater than approximately 5 Hz can in- 
crease thermal damage. The second important time 
duration is the thermal relaxation time. It is defined 
as the time needed for the heat from an ablation cra- 
ter to diffuse thermally to lateral tissue, potentially 
causing injury to normal tissue. Thermal relaxation 
time is more variable than thermal recovery time and 
depends on the distance and the temperature differ- 
ences between the source and the target tissue. It has 
been estimated to be between 7 and 700 us. Thus, a 
laser with a pulse length shorter than this should cause 
less thermal damage, because the laser energy is de- 
livered quickly enough to avoid lateral diffusion. The 
pulsed laser used in this study has a favorable pulse 
length of 7.5 us, whereas most CW CO? lasers in clin 
ical use today have beam properties in the “pulsed” 
mode outside those parameters considered favorable 
for minimal thermal injury. 

















The prototype pulsed CO? laser investigated in this 
study has been shown by histologic examination to 
cause less thermal injury in dermal and muc 
sue than a CW CO? laser. Tensiometry measurements 
were also used in previous studies’® 
strength, confirming the improved functional outcome 
with the pulsed laser. Functional outcomes in micro 
laryngeal surgery are not as dependent on wound ten- 
sion, but are more dependent upon the vibratory char- 
acteristics of the vocal fold cover and body, specifi- 
cally, the epithelium and the 3 layers of the lamina 
propria overlying the vocalis muscle. Laryngeal 
videostroboscopy provides us with a tool to evaluate 
vocal fold vibration. The canine model used has been 


osal tis- 


to assess tissue 
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shown to be an adequate model for these studies, ac- 
cepting the histologic differences between the canine 
vocal fold and the human vocal fold.!° In this present 
study, the histologic results showed qualitatively im- 
proved wound healing in the vocal folds treated with 
the pulsed CO2 laser as compared to the CW CO2 
laser in the “pulsed” mode, although these differences 
were not as dramatic as those seen in dermis and oral 
mucosa. However, the functional results conclusively 
showed better LVS parameters in the vocal folds 
treated with the pulsed CO2 laser as compared with 
the CW CO2 laser. This result, coupled with results 
seen in other tissues with use of these same lasers, 
confirms the advantages of the pulsed CO2 laser over 
the CW CO? laser with regard to wound healing. 


The results of this study indicate that the microsec- 
ond-pulsed CO2 laser is an alternative method for 
laryngeal surgery. The surgeon should be aware, how- 
ever, that some additional tissue damage and a slight 
delay in wound healing may be the compromise for 
gaining a hands-free, precise means of performing a 
laryngoscopic procedure. Also, the incision width 
produced by either CO2 laser is larger than that seen 
with a microlaryngeal sickle knife. 


We conclude that the pulsed CO2 laser investigated 


in this study has favorable laser beam parameters that 
result in better wound healing for those lesions for 
which laser techniques are advantageous. The tissue 
effect seen during the operation does not appear to 
extend to deeper, nonablated tissue. Therefore, the 
subsequent wound healing is more predictable. Non- 
neoplastic lesions involving the superficial layer of 
the lamina propria (polyps, cysts, nodules) should 
be addressed with mucosal sparing techniques and 
should not be simply ablated with the laser. Instead, 
the pulsed CO2 laser could be used to perform the 
initial epithelial incision, as the depth of penetration 
is comparable to the plane of dissection in cold steel 
microflap techniques. Development of smaller laser 
spot sizes should add to the precision by minimizing 
the width of the ablation crater. One observed disad- 
vantage with the pulsed laser is the lack of the coagu- 
lation of the microcirculation seen with the CW la- 
ser. Although this was not a problem in the opera- 
tions performed in this study, it could affect the treat- 
ment of more vascular lesions, such as respiratory 
papillomatosis. Continued research in laser-tissue 
interactions, particularly with microsecond-pulsed 
lasers, should result in further innovations to clini- 
cal lasers that will benefit the laryngologist’s prac- 
tice and surgical outcomes. 
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Functional changes in voice production, including pitch elevation, may help male-to-female transsexuals assume a new gender 
identity. To date, there has been a paucity of objective data on the effectiveness of pitch-raising methods. Acoustic data were gathered 
with regard to preoperative and postoperative changes in pitch, pitch range, and perturbation in 20 patients after cricothyroid 
approximation. Subjective data were gathered by means of a mailed questionnaire. The mean follow-up time was 22 months. The 
speaking fundamental frequency was raised by half an octave without any significant changes in perturbation. The lower and upper 
limits of pitch range both increased by an average of 4 semitones. There was some decline over time of the lower pitch range toward 
preoperative levels, but the upper pitch range remained elevated. The majority of the patients were satisfied with the results of 


surgery and felt their voices to be more feminine. 


KEY WORDS — cricothyroid approximation, phonosurgery, transsexualism, vocal pitch. 


INTRODUCTION 


Transsexuals are individuals who experience a cri- 
sis in their perception of their own gender identity. 
They remain convinced of a discrepancy between 
their psychological makeup and their anatomic make- 
up and so experience a state of gender dysphoria.! 
As defined by the DSM-IV,? gender identity disorder 
is a condition characterized by a strong and persistent 
cross-gender identification, persistent discomfort in 
the current gender role, the absence of a physical in- 
tersex condition (eg, androgen insensitivity syn- 
drome), and the presence of clinically significant dis- 
tress in social, occupational, or other important areas 
of functioning. The incidence of male-to-female 
transsexualism is believed to be 1 in 37,000 per- 
sons.34 Since the 1970s, treatment of this condition 
has involved a combination of hormone therapy and 
sex reassignment surgery>® with an increasing em- 
phasis upon a multidisciplinary model of treatment.” 
Typical surgical procedures to alter the appearance 
of male-to-female transsexuals include mamma- 
plasty, abdominoplasty, rhinoplasty, and laryngeal 
cartilage shaving. After surgery, an increased degree 
of satisfaction with quality of life has been reported, 
as well as a decrease in “antisocial and self-destruc- 
tive behavior” such as suicide attempts, criminal ac- 
tivity, and drug use.8 


One of the challenges for male-to-female transsex- 
uals has been modification of the male voice. Although 


testosterone therapy does influence the quality of the 
voice in female-to-male transsexuals, male-to-female 
transsexuals do not derive a similar benefit from es- 
trogen therapy.?:!0 Consequently, a number of differ- 
ent approaches have been used in order to provide 
vocal rehabilitation for this population. These can be 
separated into nonsurgical (ie, therapeutic) and sur- 
gical methods. 


Therapeutic alteration of male voice and speech 
patterns has received a significant amount of attention 
in the literature. As Oates and Dacakis" note in their 
review, despite the relative importance of semantics, 
pragmatics, intonation, intensity, and resonance, the 
overriding concern has been with altering vocal pitch. 
Kalra!2 reported an increase in fundamental fre- 
quency (F0) from 168 Hz to 200 Hz after 4 months 
of weekly voice therapy with a male-to-female trans- 
sexual client. Froeschel’s chewing method was used 
to effect this change without encouraging vocally abu- 
sive habits. After 7 sessions of voice therapy, an in- 
crease in pitch from 145 Hz to 165 Hz was described 
by Bralley et al!3 in a 49-year-old male-to-female 
transsexual. Naive listeners judged the voice as being 
more feminine, although it could still be differentiated. 
from a natural female voice. 


However, the importance of parameters other than 
vocal pitch has been described. Wolfe et al!4 found 
that voices only a semitone apart (145 Hz versus 155 
Hz) would be consistently categorized as either male 
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or female on the basis of formant frequencies and 
upward or downward inflections. In an attempt to 
apply these observations therapeutically, Mount and 
Salmon!> described results for a program of pitch 
elevation and resonance alteration in a 63-year-old 
male-to-female transsexual. This individual demon- 
strated an increase in F0 from 110 Hz to 205 Hz for 3 
key vowels, and maintained these gains for 5 years 
after treatment. Information regarding the Fo of the 
conversational speaking voice was not given. 


A variety of surgical procedures have been de- 
scribed that elevate pitch by altering the mass, ten- 
sion, or overall length of the vocal folds. Surgical 
methods to increase vocal fold tension include ante- 
rior commissure advancement!®-!7 and cricothyroid 
approximation (CTA).!% Alteration of vocal fold mass 
is accomplished by scarification of the vocal folds,!'9 


steroid injection of the vocal folds, or reduction of 


vocal fold mass by CO2 laser vaporization. Donald? 
has described a technique in which both the length 
and mass of the vocal folds are surgically altered by 
removing an anterior portion of the vocal folds and 
thyroid cartilage and creating an anterior web. 


Advantages and disadvantages have been de- 
scribed anecdotally for each of these procedures, and 
results have been reported for small numbers of pa- 
tients. To date, however, there has been little long- 
term follow-up to document their relative efficacy. 
One exception is the study by Gross?! of 9 patients 
who underwent a modified form of Donald’s proce- 
dure. He reported a 9.2-semitone increase in FO for 
these patients 3 years after surgery. Six patients who 
underwent a similar surgical procedure were followed 
for various lengths of time by Kunachak et al.?? Of 
these, 4 patients maintained an FO of over 300 Hz at 
a period of between 2 and 6 years after operation. 


Unfortunately, even limited data such as these do 
not exist regarding the long-term efficacy of the ear- 
liest and perhaps most popular of these types of sur- 
geries: CTA. Isshiki et al? described their experience 
using this procedure in human subjects first in 1976, 
and again, with a larger series of patients, in 1983.74 
In the earlier study, both individuals were women 
with androphonia. The 11 patients who underwent 
CTA in the 1983 study included | male-to-female 
transsexual, 6 women with androphonia resulting 
from anabolic steroid use, | woman with adrenocorti- 
cal syndrome, 2 women with a unilateral cricothyroid 
muscle paralysis (who underwent CTA of the right 
side only), and | woman with congenital sulcus vo- 
calis (who underwent CTA with expansion of the left 
thyroid ala). The male-to-female transsexual experi- 
enced an increase in pitch from 110 Hz to 164 Hz, 
while the other 10 subjects experienced an average 


increase of 48 Hz, from 165 Hz to 213 Hz. These 
results are, however, difficult to generalize, owing 
to the absence of information regarding length of fol- 
low-up, differences in the subject populations, and 
the multiple methods used to increase pitch. Accord- 
ing to a personal communication cited by Koufman 
and Isaacson,*> the excellent short-term results 
achieved by these patients were not maintained, but 
no additional information is provided. 

There is also a paucity of satisfactory documenta- 
tion regarding long-term complications. On a theo- 
retical basis, Isshiki et al have indicated that “late 
loosening of the knot is a problem worthy of due con- 
sideration.” !*(P456) Although | female patient treated 
by a combination of CTA and intracordal injection 
of a corticosteroid (triamcinolone) was described as 
being satisfied with her voice outcome 2 years after 
surgery, |sshiki reported that “the vocal folds seem- 
ingly became a little lax 1 year after the opera- 
tion,”6(P789) and attributed the positive outcome to a 
combination of voice training and the presence of 
the steroid. 

More objective documentation of postoperative 
outcomes is contained in a recent article by Pickuth 
et al?” in which computed tomography was used to 
compare the preoperative and postoperative crico- 
thyroid distances in 29 transsexual patients. After at 
least | year’s follow-up, their patients had an average 
increase of pitch from 118 Hz to 226 Hz; and they 
calculated that on average, pitch increased approxi- 
mately 18 Hz for every millimeter of approximation. 
They found that those who had only a slight reduction 
of cricothyroid distance had a tendency to postop- 
erative reversion of vocal pitch, and attributed this 
to late loosening of the sutures. 


The purpose of this study was to review the effect 
of CTA as performed at our institution on male-to- 
female transsexual patients, with regard to both ob- 
jective and subjective variables. Goals included the 
objective measurement of changes in pitch and pitch 
range, and the assessment of subjective perceptions 
related to vocal clarity, pitch, femininity, volume, 
range, and overall satisfaction. Information regarding 
difficulties experienced after operation was also 
sought. 

MATERIALS AND METHODS 


A retrospective chart review was undertaken of 
all male-to-female transsexuals who had undergone 
primary CTA by the primary authors (J.I.C., T.R.M.) 
between June 1995 and June 2000. Of these 73 indi- 
viduals, 5 were excluded because the acoustic data 
from their preoperative voice evaluation had not been 
recorded. Of the remaining 68, there were 31 patients 
(46%) who had been evaluated within | year after 
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Fig 1. Planned incision. Strap mus- 
cles have been separated in mid- 
line to completely expose cricoid 
and thyroid cartilages. 


their surgery. Nine patients had been reevaluated 12 
months or more after their surgery. In order to obtain 
more long-term follow-up data, we attempted to con- 
tact the 35 individuals who had undergone CTA and 
were currently residing on the West Coast. These in- 
dividuals were contacted by letter and by telephone 
by means of our most recent contact information. 
When both of these methods failed, we attempted to 
locate the individual using directory inquiries for the 
last known city of residence. Given the sensitive 
nature of assuming a new identity after gender reas- 
signment surgery, requests for no further follow-up 
were respected. By these means, l-year follow-up 
acoustic data were obtained for 20 of our original 68 
patients (29%). Follow-up acoustic data were taken 
in the manner described below. 


The standard preoperative protocol included fiber- 
optic laryngoscopy and acoustic voice measurement 
across a variety of speech tasks. Fundamental fre- 
quency was measured on a sustained /a/ (“ah”) and 
during the reading of a standard text (the Rainbow 
Passage). Pitch perturbation or jitter was recorded 
across 2 trials of the sustained vowel task. The fre- 
quency of the sustained vowel with the lower pertur- 
bation value was always used. Pitch range was ascer- 
tained by recording the highest and lowest frequen- 
cies reached across 2 trials of pitch glides. Videostro- 
boscopy was not part of the standard preoperative 
evaluation, as this technology was not available to 
the authors in 1995. All acoustic measurements were 
recorded with a 6087 DS model VisiPitch device 
(Kay Elemetrics, Lincoln Park, New Jersey). 


In addition to objective measures, we also gathered 
information regarding patient satisfaction. To this 
end, a questionnaire was mailed to all 73 individuals. 
The questionnaire addressed vocal clarity, loudness, 
femininity, range, physical difficulties, and overall 
satisfaction. Questions related to the use of voice 
therapy and physical difficulties were also included. 
The survey format included open-ended questions, 
5-point Likert-type scales, and binary choices (“yes” 
or “no”). Of the questionnaires mailed, 36 (49%) 
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were returned completed and 11 (15%) were unde- 
liverable. 


Surgical Technique. The technique used involves 
minor modifications of the methods described by Is- 
shiki et al24 and Sataloff et al,?8 which are noted be- 
low. The procedure is performed under local anesthe- 
sia with monitored anesthesia care. One percent lido- 
caine hydrochloride with 1:100,000 epinephrine bi- 
tartrate is injected into the planned surgical site. The 
neck is prepared and draped in a sterile manner. A 
transverse incision is made in an existing skin crease, 
if possible at the level of the vertical midpoint of the 
thyroid ala. Subplatysmal skin flaps are elevated, the 
strap muscles are separated in the midline, and the 
anterior aspects of the thyroid and cricoid cartilages 
are completely exposed (Fig 1). Subperichondrial ele- 
vation is performed on either side of the midline (to 
avoid detaching the anterior commissure tendon) on 
the inner aspect of the lower | cm of the thyroid carti- 
lage. Similarly, a subperichondrial dissection of the 
inner aspect of the anterior cricoid cartilage is accom- 
plished (Fig 2). Four holes, 2 on either side of the 
midline, are created with a drill along the lower edge 
of the thyroid cartilage in the area that has been dis- 
sected. These holes and the 4 sutures of 0 Ethibond 
that are sequentially placed circumferentially around 
the anterior 120° of the cricoid cartilage and through 
the holes in the thyroid cartilage (with care taken to 
stay in a subperichondrial plane) are approximately 
0.5 cm and | cm from midline. Each vertically ori- 
ented suture includes a small amount of the tissue 
below the cricoid arch, the anterior cricoid arch, the 
cricothyroid membrane, and the lower thyroid carti- 
lage. The inclusion of tissue below the cricoid arch 
cushions the force it applies to the cricoid arch and 
helps prevent fracture or transection of the arch by 
the suture. We have not found that bolsters are neces- 
sary? to prevent this inadvertent transection of the 
cartilages by the suture, provided adequate soft tissue 
is included inferiorly and the holes in the thyroid car- 
tilage are placed at least 5 mm from its inferior mar- 
gin. The sutures are then tightened and the patient is 
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asked to vocalize to test for appropriate pitch eleva- 
tion; usually, the end point is just beyond the point 
of maximal pitch elevation to a slightly strained whis- 
per. The sutures are then tied (Fig 3). Hemostasis is 
assured, and the wound is closed in multiple layers. 
No attempt is made to abrade the opposing cartilagi- 
nous surfaces to encourage their fusion.2® Instead, 
we rely upon the 4 sutures to maintain the relative 
positions of the cricoid and thyroid cartilages. In a 
small number of cases reexplored for the purposes 
of thyroid cartilage shaving, we have found a thick 
fibrous scar band surrounding these sutures and bridg- 
ing the inferior thyroid and superior cricoid cartilage 
margins. This probably serves to maintain their posi- 
tion over time. In most cases, adequate pitch elevation 
occurs with incomplete obliteration of the cricothy- 
roid space; in other cases, complete obliteration of 
the space is needed. Rarely, perhaps because of in- 
dividual anatomic variation, subluxation of the upper 
portion of the cricoid arch below the lower border of 





Fig 2. Subperichondrial elevation on inner aspect 
of lower | cm of thyroid cartilage is performed on 
either side of midline (but sparing 5 mm at midline 
to avoid detaching anterior commissure tendon). 
Similarly, subperichondrial dissection of inner as- 
pect of anterior cricoid cartilage is carried out to 
completely free it from underlying laryngotracheal 
membrane. 


the thyroid cartilage is needed. The intraoperative vo- 
cal pitch change, rather than anatomic landmarks, 
determines the degree of approximation. 


Although voice rest is not required from a healing 
standpoint, the patients are placed on voice rest fora 
period of 10 days. This rest is designed to allow for 
the initial stretching of the laryngeal soft tissues (and 
perhaps the sutures), which in our experience invari- 
ably occurs in the first 7 to 10 postoperative days. 
This stretching lowers the pitch from the point of 
strained whisper to a more desirable elevated pitch. 
In addition, it allows the initial swelling and spasm 
in the overlying laryngeal strap muscles created by 
the operation to subside, so that at the point of speech 
resumption, laryngeal elevation is again possible. In 
this way, when the patient first resumes speech, her 
postoperative voice is likely to be closer to that which 
is desired, and the patient is likely to be encouraged 
by her early experiences. 





Fig 3. A) Four sutures of 0 Ethibond are sequentially placed in circumference around anterior aspect of cricoid 
cartilage and through holes in thyroid cartilage. Each suture includes small amount of soft tissue below cricoid arch, 
anterior cricoid arch, cricothyroid membrane, and lower thyroid cartilage. B) Sutures are then tightened, and patient 


is asked to vocalize. 
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SURGICAL RESULTS FOR 20 PATIENTS 














Ageat Durationof Habitual Fo 
Surgery Follow-up for “ah” (Hz) 
Case (y) (mo) Preop Postop 
1 44 16 239 206 
2 42 15 144 259 
3 59 35 190 225 
4 39 14 142 191 
5 33 25 — 120. 184 | 
6 45 19 176 -> -203 
7 51 24 138 205 
8 50 16 108 210 
9 34 12 107 . 131 
10 58 16 138 - 179 
11 63 16 99 189 . 
12 46 29 110 177 `> 
13 47 16 137 249 
14 44 24 134 206 
15 49 17 200 195 
16 40 23 137 216 
17 45 24 168 212 
18 48 19 163 207 
19 39 30 112 133 
20 53 52 129 272 
Mean 46.5 22.1 144.5 202.4 
SD 7.9 9,3 35.7 34.7 
Range 33-63 12-52 99-239 131-272 


Fo — fundamental frequency. 


RESULTS 

Objective Measures. Follow-up acoustic measure- 
ments were available for 20 of the 68 patients oper- 
ated on during the study period and form the basis of 
this report. They are presented in the Table and in 
Fig 4. These measurements were taken between 12 
and 52 months after operation (mean, 22 months). 
The individuals ranged in age from 33 to 63 years at 
the time of surgery (mean, 46 years). 
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Habitual Fo Pitch Range Perturbation, ie, 
for Reading (Hz) (semitones) Jitter (%) 
Preop Postop Preop Postop Preop Postop 

156 181 27 34 0.6 0.5 
141 210 22 26 0.6 0.2 
158 189 26 25 0.2 0.9 
125 202 22 19 0.2 0.4 
106 198 29 26 0.5 0.8 
125 178 26 27 0.6 1.4 
148 194 25 21 1.1 0.8 
114 183 46 30 0.7 
111 124 13 21 0.7 1.0 
156 167 26 44 0.6 L1 
112 218 27 25 0.4 
177 28 1.5 
128 227 28 11 0.7 0.2 
es 167. 33 22 0.5 0.3 
163 176 31 38 0.2 0.2 
164 227 36 0.7 
129 185 35 1.4 0.7 
136 190 32 0.5 0.8 
100 156 31 36 0.4 
160 30 0.3 1.3 
133.6 185.4 27.8 28.2 0.6 0.7 
21.1 25.0 6.8 7.8 0.3 0.4 
100-164 124-227 13-46 11-44 0.2-1.4 0.2-1.5 


With regard to habitual pitch, before surgery these 
individuals demonstrated an Fo of 145 Hz (D3) and ` 
134 Hz (C3), respectively, on sustained vowel and 
reading tasks. At follow-up, these tasks were mea- 
sured at 202 Hz (G3) and 185 Hz (F#3). The average 
increases in pitch, therefore, were 5 and 6 semitones 
across these two tasks, or approximately half an oc- 
tave. Normative data for men in the 40- to 49-year 
age range suggest an Fo of 107 Hz during reading.290 
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Data for women in the same age range indicate an 
average of 189 Hz during reading tasks.3! Thus, the 
habitual pitch for the individuals in our study was 
increased 4 semitones above the preoperative average 
and was age- and gender-appropriate at follow-up. 


Measures of perturbation, the irregularity of vocal 
fold vibration, were not recorded for all individuals. 
However, for the 14 individuals with both preopera- 
tive and postoperative measures of frequency pertur- 
bation (jitter), the preoperative average of 0.6% in- 
creased to 0.7% after operation. Perturbation de- 
creased for 6 of these subjects by as much as 0.7%, 
and increased for the remaining 9 by as much as 
1.0%. The individual with the greatest increase in 
perturbation (subject 20) was noted at follow-up to 
have significant reflux laryngitis, which may have 
contributed to her hoarseness. However, an average 
increase of 0.1% suggests little change in hoarseness. 


Comparing the individuals’ preoperative and fol- 
low-up pitch ranges, we found no significant differ- 
ence. Before surgery, the average pitch range was 
27.8 semitones (range, 13 to 46), and after surgery it 
was almost identical at 28.2 semitones (range, 19 to 
44). The lower limit of pitch range was measured at 
86 Hz (F2) before operation, and this increased to 
106 Hz (A2) at follow-up: an increase of 4 semitones. 
The upper limit likewise increased by 4 semitones, 
from 440 Hz (A4) to 532 Hz (C#5). Normative data 
for pitch range indicate that men between the ages 
of 40 and 65 years typically have a pitch range of 29 
semitones, and women between the ages of 35 and 
70 years have a range of 31 semitones.*7 


Some of the patients reported a decline in pitch 
from the time immediately after operation to the time 
of follow-up. We were able to substantiate this by 
examining the outcomes of 14 of these 20 patients in 





Over lyr Post-Op (M = 24 mos.) 


greater detail. Acoustic data were recorded within a 
year of surgery and were compared with their follow- 
up data (Fig 5). At a mean length of 6 months after 
operation, their Fo for speech had increased by 7 
semitones, from 136 Hz (C#3) to 209 Hz (G#3). By 
an average of 24 months after surgery, their FO was 
measured at 182 Hz (F#3), or a difference of 5 semi- 
tones from their preoperative pitch. Similar sustained 
but somewhat diminished increases were seen in the 
measurements for upper pitch range and average sus- 
tained Fo. However, although the lower limit of these 
patients’ pitch range showed an initial postoperative 
increase of 9 semitones, from 88 Hz (F2) to 145 Hz 
(D3), this was followed by a decline at follow-up to 
a level only | semitone above their preoperative mea- 
surement (90 Hz or F#2). Because this decline in the 
lower limit of the pitch range was greater than that 
seen in the upper pitch range, which remained rela- 
tively stable, these 14 patients demonstrate a steady 
increase in their dynamic range. Their dynamic range 
climbed from a preoperative one of 306 Hz to 315 
Hz at 6 months after operation and to 387 Hz at long- 
er-term follow-up: an overall increase of 2 semitones. 


Subjective Measures. A self-evaluation survey was 
mailed to each of the 73 patients identified by chart 
review. With regard to overall satisfaction with their 
current voice, 58% of the respondents indicated that 
they were satisfied, 33% were dissatisfied, and 8% 
were neutral about the results of surgery. The vocal 
quality was reported as being predominantly clear 
by 42% of subjects, “rough or hoarse” by 23%, and 
“fair” by 34%. The vocal pitch was rated as feminine 
by half of the respondents, masculine by one quarter, 
and neutral by the remainder. Asked how well the 
individual’s new voice fit with her current self-per- 
ception, 51% responded positively, 29% responded 
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negatively, and 20% felt it to be neutral. Regarding 
the femininity of their voices, the individuals were 
asked how often their voice was mistaken for a mas- 
culine one on the telephone. Forty-seven percent re- 
sponded “occasionally” or “sometimes,” 31% re- 
sponded “never,” and 22% responded “frequently” 
or “always.” Most of the respondents (81%) consid- 
ered their vocal loudness adequate, and slightly over 
half (51%) considered their vocal range adequate. 
Less than half of the respondents found that coughing 
(41%) or throat-clearing (45%) sounded more femi- 
nine after voice surgery. More than half (59%) found 
an improvement in the femininity of their laugh. 


With regard to voice therapy, the majority (64%) 
did not see a therapist to work on improving their 
postoperative voice. Of those that did, the average 
number of therapy sessions was 9.8. Comparing the 
acoustic data of those who received voice therapy and 
those who did not, we made 2 significant findings. 
Those who sought therapy had an upper pitch limit 
of 455 Hz (SD, 102 Hz) and a dynamic range of 350 
Hz (SD, 94 Hz); those who did not had an upper lim- 
it of 612 Hz (SD, 153 Hz) and a range of 518 Hz 
(SD, 130 Hz). Thus, individuals who received therapy 
demonstrated a reduced upper limit to their pitch range 
(t = —2.20, p = .05) and a reduced dynamic range (t = 
—2.71, p = .02) at their follow-up evaluation. Of 
course, these statistics cannot be interpreted as indi- 
cative of the efficacy of voice therapy, since subjects 
were not randomly assigned for treatment. Although 
all patients were counseled regarding the value of 
voice therapy, it seems plausible that those who chose 
to pursue it were those who had experienced less ex- 
treme pitch gains after surgery. 


The individuals were asked about physical prob- 
lems that they experienced in the postoperative pe- 
riod and those that they continued to experience cur- 
rently. Problems experienced immediately after oper- 
ation were (in order of frequency) trouble swallow- 
ing or choking on liquids (34%), sore throat or dis- 
comfort (29%), frequent throat-clearing (9%), hoarse- 
ness (6%), reduced vocal volume (6%), and vocal 
fatigue (3%). Problems experienced currently by re- 
spondents were trouble swallowing or choking on 
liquids (29%), sore throat or discomfort (14%), fre- 
quent throat-clearing (9%), reduced vocal volume 
(6%), vocal fatigue (6%), and hoarseness (3%). 


In general, there was no correlation between the 
individuals’ subjective evaluation of voice and the 
objective measurements performed. There were a few 
exceptions, however, most notably relating to the 
lower limit of the pitch range. The subjects whose 
lower limit of pitch range was low were significantly 
more dissatisfied with the femininity of vegetative 


sounds such as throat-clearing (t = 3.19, p = .011) 
and laughing (¢ = 3.38, p = .009), as well as with 
their vocal loudness (t = 2.73, p = .018). There were 
no significant differences on any of the objective 
acoustic measures in relation to subjective judgments 
of overall satisfaction, vocal pitch, vocal clarity, the 
suitability of the new voice, or the femininity of the 
voice. 


DISCUSSION © 


There is a paucity of data on the long-term effec- 
tiveness of pitch-raising surgeries for male-to-female 
transsexuals. The studies published thus far have been 
limited by a number of factors, including small sam- 
ple size, limited length of follow-up, the absence of 
a thorough voice evaluation protocol, and the group- 
ing of subjects with different diagnoses. Long-term 
outcomes have been forthcoming for other pitch-rais- 
ing surgeries,?!22 but have not been similarly col- 
lected for CTA. Anecdotal observations have cast 
doubt on the stability of voice outcomes after CTA. 


Our postsurgical investigation of 20 patients dem- 
onstrates that at a mean length of follow-up of 22 
months, the patients demonstrated a pitch increase 
of 6 semitones over their preoperative speech fre- 
quency. Pitch for a sustained tone increased from D3 
to G3, while speech went from C3 to F#3. Comparison 
with normative data indicated that the Fo of their post- 
operative speech was both age- and gender-appropri- 
ate. Pitch range remained the same from preoperative 
to postoperative measurement, at 28 semitones, but 
increases were noted for the upper and lower limits 
of the pitch range. 


Subjectively, at least half of our survey respondents 
were satisfied with their current voice, considered it 
to be feminine, and felt that it fit well with how they 
perceived themselves. Almost two thirds of our pa- 
tients were “occasionally” or “never” mistaken for a 
man on the telephone. Voice quality was described 
as clear by 42% of respondents, half considered their 
vocal range adequate, and over two thirds felt their 
vocal loudness to be adequate. The most common 
long-term problems were trouble swallowing (29%), 
sore throat or discomfort (14%), and frequent throat- 
clearing (9%). There was, however, no apparent cor- 
relation between the patients’ satisfaction and their 
subjective impression of the femininity of their voice 
and the objective measurements performed. This sug- 
gests that factors other than change in pitch influence 
this judgment. 


It is important to note that some of the patients 
reported a gradual decline in their postoperative pitch, 
and that our objective measurements confirm this. 
These results substantiate the argument that the initial 
pitch increase is somewhat reduced by the end of the 
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first year and are in agreement with a review con- 
ducted by Moerman et al,*? who concluded that the 
CTA procedure results in an initial pitch increase of 
l octave, which then stabilizes at the level of a half- 
octave gain over preoperative voice levels. However, 
in our study, because the lower pitch range declined 
more than the relatively stable upper pitch range, the 
most substantial change over time was a relative in- 
crease (81 Hz or 2 semitones) in the patients’ dynamic 
pitch range over time. 


Although this change over time may be explained 
in part by stretching of the laryngeal soft tissues 
placed under tension by the operation, it may also be 
related to the most common difficulty encountered 
after pitch-raising surgery for male-to-female trans- 
sexuals. Like Gross,?! we have found that after many 
years of speaking at an artificially elevated pitch, 
many of our patients produce voice with a consider- 
able degree of strain. This habit may be so ingrained 
that they have the equivalent of a muscle-tension dys- 
phonia and are unable to produce relaxed phonation. 
Consequently, some of our patients experience un- 
naturally elevated pitch after surgery and require ex- 
tensive voice therapy to counteract it. In fact, the abil- 
ity to voice at a pitch within the masculine range is 
now part of our preoperative consultation; those who 
are unable to do so are regarded as being suboptimal 
surgery candidates and may require preoperative 
voice therapy. 


It is our contention, then, that the presence of a 
persistent state of tension may skew the preoperative 
acoustic data of this patient population in a positive 
direction. The combination of additional tension pro- 
duced by the CTA surgery in association with this 
hyperfunction produces a reduced dynamic range 
during the first year after operation. However, by 24 
months after the surgery, a decrease in hyperfunction- 
al voice habits in association with some degree of 
relaxation (of the vocal folds or sutures) would pro- 
duce the results described above: a continued increase 
in FO over preoperative levels in association with an 
increased dynamic range. Of course, this explanation 
remains conjecture. 


The combination of these tension dysphonia be- 
havioral issues, the unknown degree to which stretch- 
ing occurs in the laryngeal soft tissues after surgery, 
and the mariy factors that contribute to the perception 
of feminine voice probably account for the lack of 
correlation between the subjective survey results and 





the objective measurements taken. Clearly, further 
information that accounts for these factors in the as- 
sessment of the long-term outcomes of patients after 
CTA is long overdue. Preoperative documentation 
of cricothyroid distance as a baseline for postopera- 
tive comparison will be helpful. 


Although our data are incomplete and a large per- 
centage of patients are not, or choose for confiden- 
tiality reasons not to be, available for follow-up, this 
review confirms our subjective impression that CTA 
can facilitate the acquisition of a feminine pitch. 
However, it should be seen as an integral part of an 
overall treatment protocol and not as a substitute for 
appropriate voice therapy, which is essential for suc- 
cessful outcomes. In the words of one of our survey 
respondents, “CTA enabled me to be able to do the 
things I learned in voice [therapy]. CTA allowed me 
to have a choice about the gender I am.” Consequent- 
ly, the importance of voice therapy before and after 
operation is iterated during our preoperative consulta- 
tion, and therapy is required of all patients. The limi- 
tations of the surgery in terms of its negative impact 
on the singing voice and the possibility of vocal 
roughness are emphasized, and professional voice 
users are strongly encouraged to consider alternative 
treatment methods for these reasons. A thorough 
voice evaluation including both laryngoscopy and 
acoustic measurement is critical in order to rule out 
vocal disease, which may be exacerbated by the sur- 
gery if not addressed before operation. Finally, at- 
tempts are made to connect the patient with resources 
in her local area that will facilitate the transition. Al- 
though follow-up is difficult for patients who are not 
from the local area, postsurgical visits are important 
to ensure continued vocal health. 


CONCLUSIONS 


1. At an average of 22 months after surgery, the 
20 patients in our study showed an increase in the 
pitch of their speaking voice of 6 semitones and an 
increase in their dynamic range of 2 semitones. No 
significant difference in perturbation was noted. 


2. The majority of patients were satisfied with the 
surgery and felt that their voices were more feminine 
and more appropriate to their self-image. However, 
subjective results did not correlate with objective 
measurements. 

3. Voice therapy is an essential part of treatment 


as part of a thorough protocol for optimum voice out- 
comes. 
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585-NM PULSED DYE LASER TREATMENT OF GLOTTAL 
PAPILLOMATOSIS 
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Treatment of recurrent respiratory papillomatosis of the glottis is often challenging. The surgeon and patient must cooperatively 
balance decisions regarding airway safety, effects of multiple general anesthestas, employment disturbance, and vocal dysfunction. 
A pilot study was done in 41 adult cases (23 patients; 78 vocal folds) without complication to evaluate the effectiveness of a 585-nm 
pulsed dye laser (PDL; 450-us pulse width; fluence of 38 to 255 J/cm?; 1- to 2-mm spot size) in the treatment of this disorder. Thirty- 
seven of the 41 cases (90%) were bilateral disease. Twenty-six of the 41 cases (63%; including 20 cases with involvement of the 
anterior commissure) were treated by bilateral photocoagulation of the lesions’ microcirculation without microflap resection of 
tissue. Clinical observation revealed that irradiated but unresected disease involuted without development of an anterior commissure 
web. In the initial 13 of the 41 cases (32%), PDL treatment was followed by cold instrument microflap resection. The PDL enhanced 
the epithelial excision by improving hemostasis and by creating an optimal dissection plane between the basement membrane and the 
underlying superficial lamina propria. The PDL at 585 nm was less effective in the management of exophytic lesions because of its 
limited depth of penetration (approximately 2 mm). In this initial trial, the PDL was a relatively safe and efficacious treatment for 
glottal recurrent respiratory papillomatosis. Since the lesions involute without complete resection of the diseased epithelium, the 
anterior commissure can be treated to minimize the number of procedures. To study patterns of recurrence will require longer follow- 


up. 
KEY WORDS — glottal papillomatosis, glottis, pulsed dye laser treatment, recurrent respiratory papillomatosis, vocal cord, 
vocal fold. 


INTRODUCTION rounding normal vocal fold structures that are asso- 
ciated with localized surgical management. Photody- 
namic therapy® has been employed with numerous 
photosensitizing agents to selectively target diseased 
epithelial cells. Cells that are HPV-infected preferen- 
tially concentrate the photosensitizer, as compared 
with the surrounding normal cells. The diseased epi- 
thelium sustains selective damage when the appro- 
priate light activates the photosensitive agent, caus- 
ing cellular destruction. Although this approach is 
elegant in principle, it has not yet proved effective 
enough to replace conventional ablative manage- 
ment. Clinicians have been hesitant to expand the 
role of photodynamic therapy because of concerns 
of hepatic and renal toxicity, as well as generalized 


Treatment of recurrent respiratory papillomatosis 
(RRP) has been an omnipresent laryngological prob- 
lem that has been challenging since the origin of the 
specialty! and remains so today. The viral source for 
this infection is human papillomavirus (HPV) type 
6 and/or 11.2 Although RRP is a benign neoplastic 
process, it is a potentially life-threatening disease that 
can obstruct the airway. Its recurrent behavior leads 
to repeated use of general anesthesia and the associ- 
ated morbidity. Glottal disease typically results in dys- 
phonia, with a variable detrimental effect on patients’ 
personal, social, and vocational lives, given our com- 
munication-based society. 


Microlaryngoscopic treatment of laryngeal RRP photosensitivity. Although newer photodynamic com- 
typically comprises carbon dioxide (CO2) laser abla- pounds are safer, can be administered topically, and 
tion.’ Despite the fact that the newer-generation CO2 have much shorter periods of generalized photosen- 
lasers have smaller spot sizes and improved deliv- sitivity (several days), their efficacy continues to be 
ery systems, there continues to be a significant zone inconsistent. 7° 


of thermal injury to the underlying superficial lamina 


propria and unavoidable loss of epithelial pliability.4> Anolherterapenie Meaimies model tias basn to 


target the virus replication with parenteral antiviral 
A variety of strategies have been devised to treat medications.” !0 There is optimism regarding the use 
RRP and minimize the deleterious effects on the sur- of cidofovir,!! which is reported to have enhanced 





From the Departments of Otology and Laryngology (Franco, Zeitels) and Dermatology (Farinelli, Anderson), Harvard Medical School, the 
Division of Laryngology, Massachusetts Eye and Ear Infirmary (Franco, Zeitels), and Wellman Laboratories of Photomedicine, Massachusetts 
General Hospital (Farinelli, Anderson), Boston, Massachusetts. 

Presented at the meeting of the American Broncho-Esophagological Association, Palm Desert, California, May 14-15, 2001. 
CORRESPONDENCE — Ramon A. Franco, Jr, MD, Dept of Otolaryngology, Massachusetts Eye and Ear Infirmary, 243 Charles St, Boston, MA 
02114. 


486 











Franco et al, 585-nm Pulsed Dye Laser Treatment of Glottal Papillomatosis 487 


efficacy as compared with other antiviral agents. Un- 
fortunately, the therapeutic regimen requires repeated 
microlaryngoscopic administration at frequent inter- 
vals to achieve involution of the lesions. Therefore, 
the patient is subjected to multiple general anesthesias 
— possibly more than conventional surgical manage- 
ment would dictate. 


Phonomicrosurgical cold instrument techniques!” 
that are designed to resect the papillomatous epithe- 
lium en bloc‘ are effective at removing visible dis- 
ease, but probably fail to eradicate the disorder be- 
cause of latent virus in the “normal-appearing” epi- 
thelium at the lesion’s perimeter.!3 As the new epithe- 
lium regenerates, it becomes reinfected by the endog- 
enous reservoir of virus from the surrounding muco- 
sa.}4 Staged resections are necessary if there is dis- 
ease located on the medial surfaces of the anterior 
commissure, because of the potential for web forma- 
tion.!5 


Other approaches that have been attempted with 
varying success include cauterization,!® ultrasound 
therapy, topical chemotherapy,!” steroids, and autoge- 
nous vaccines.!8 The shortcomings of the aforemen- 
tioned methods catalyzed the search for a phonomi- 
crosurgical treatment strategy that would eradicate 
the diseased epithelium (both visible and asymptom- 
atic regions) and minimize trauma to the vocal folds’ 
layered microstructure. 


Since papillomatous lesions are benign angioma- 
tous neoplasms with a microvascular core, a surgi- 
cal angiolysis model of tumor control is philosophi- 
cally well suited. In turn, the pulsed dye laser (PDL) 
at 585 nm was chosen on the basis of the properties 
of its design and function by Greenwald et al!? and 
Anderson and Parrish.?° Similar to the dermatologic 
model, it was theorized that the microvasculature 
could be targeted to involute the disease while mini- 
mizing trauma to the surrounding soft tissue. In theory, 
this means would be ideal for maintaining pliability 
of the vocal folds’ layered microstructure and glottal 
sound production. 


A review of the dermatologic and otolaryngologi- 
cal literature revealed limited experience with the 585- 
nm PDL in the treatment of cutaneous and laryngeal 
papillomas.?!-24 The 585-nm PDL energy penetrates 
epithelium without damaging it, and is selectively ab- 
sorbed by the underlying microvasculature. !9:20.25-27 
Its early uses were for the management of cutaneous 
vascular lesions such as port-wine stains and capil- 
lary hemangiomas.?8 The PDL is now the treatment 
of choice for many vascular lesions and is currently 
used for telangiectasias, hypertrophic scars, keloids, 
and striae.29 


McMillan et al? and Valdez et al3° used the PDL 


to treat laryngeal papillomatosis in 13 patients. Be- 
fore treating humans, they performed a canine laryn- 
geal study, which revealed no acute or delayed epi- 
thelial damage or fibrosis due to PDL therapy on the 
vocal folds. When they used the PDL in humans for 
laryngeal papillomatosis, there was complete regres- 
sion of disease without vocal fold epithelial disrup- 
tion. These results were corroborated by Bower et 
al,24 who used the 585-nm PDL in 9 cases of laryn- 
geal papillomatosis. The PDL-treated vocal folds had 
disease eradication comparable to that of CO2 la- 
ser—treated folds. Because the epithelium can be pre- 
served with the PDL (noncontact delivery of laser 
energy to the tissues), anterior commissure lesions 
can theoretically be treated bilaterally, decreasing the 
number of procedures that an individual would un- 
dergo without the risk of creating a web. 


MATERIALS AND METHODS 


A Photogenica V 585-nm PDL (Cynosure Inc, 
Chelmsford, Massachusetts) was used to photocoagu- 
late the vocal fold microvasculature (at 450-us pulse 
width, 5 J per pulse maximum output of 1 Hz, 1-mm 
fiber, 1- to 2-mm spot size, 38 to 255 J/cm? fluence). 
The study protocol was approved by the human stud- 
ies institutional review board of the Massachusetts 
Eye and Ear Infirmary. Patients older than 16 with 
symptomatic glottal papillomatosis requiring surgi- 
cal intervention were given the option of standard 
therapy (cold instrument resection and/or CO2 laser 
ablation) versus PDL treatment. Twenty-three pa- 
tients with glottal papillomatosis were treated with 
the 585-nm PDL from July 1999 through March 
2001. 


After the induction of general anesthesia, the Uni- 
versal Modular (UM) glottiscope3!.32 was used to al- 
low access to the true vocal folds. The microlaryngo- 
scopic technique has been described previously.33-35 
Universal laser precautions were followed at all times, 
including protective eyeglasses for the entire opera- 
tive team, wet eye-pads and overlying towels on the 
patient’s face, and a wet cottonoid protecting the en- 
dotracheal tube cuff. A straight cleaved, 1-mm core 
diameter, fused silica fiber was placed in the lumen 
of a laryngeal suction cautery and inserted through 
the uniquely designed lateral slots?!-32 of the UM glot- 
tiscope to within 2 mm from the surface of the true 
vocal folds. The output of the laser was typically set 
to 500 to 570 mJ per pulse (range, 480 to 800 mJ per 
pulse). The treatment end point was photocoagula- 
tion of the microcirculation as evidenced by intralu- 
minal thrombus formation and darkening of the ves- 
sels. The fiber was not a fixed distance from the tis- 
sues, but varied between contact and 2 mm, such that 
fluences ranged from 38 to 255 J/cm2. Fiber-to-tis- 
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TABLE 1. OVERVIEW OF ALL CASES IN WHICH 
PULSED DYE LASER WAS USED _ 





Anterior 
Commissure With Without 
Cases Involvement Resection Resection 





No. % No. % No. % No. & 
Unilateral 4 10 2 5 2 5 
Bilateral 37 90 27 66 Il 27 26 63 
Total 41 27 66 13 32 28 68 


There were 41 cases in which pulsed dye laser photocoagulation of 
superficial lamina propria microcirculation was performed. Earlier 
in series, epithelial lesions were resected by means of epithelial 
microflap; however, majority did not undergo formal resection. 


sue distance was one means of adjusting the fluence 
to effect the treatment end point. Supraglotta! disease 
was treated primarily by means of CO2 laser abla- 
tion, because the associated thermal injury is of no 
clinical consequence and because the PDL was im- 
practical for large and exophytic lesions. In those 
selected cases in which resection was performed, the 
tissues were photocoagulated with the PDL before 
resection. 
RESULTS 

Twenty-three patients (12 men and 11 women) 
underwent 41 surgical procedures on 78 vocal folds 
over a period of 19 months with the use of the PDL. 
Each subject underwent a mean of 1.78 surgeries (SD, 
0.74; range, 1 to 3 surgeries) for control of the RRP 
with the 585-nm PDL. The mean number of pulses 
from the 585-nm PDL during each surgical case was 
477 pulses per case (SD, 526: range, 8 to 1,956 pulses 
per case). The mean energy output from the laser was 
549 mJ per pulse (SD, 74.5; range, 480 to 800 mJ 
per pulse). The fluences varied from a mean of 186 
J/cm? (SD, 73; range, 153 to 610 J/cm?) at 1 mm from 
the tissues to a mean of 44 J/cm? (SD, 6; range, 38 to 
64 J/cm?) at 2 mm. 


TABLE 2. OVERVIEW OF CASES IN WHICH 
BILATERAL DISEASE WAS PRESENT _ 


With Without 
Cases Resection Resection 
No. % _No. % No. X 


Anterior commissure 


involved 27 Ts 7 19 20 54 
Anterior commissure 

not involved 10 27 4 1] 6 16 
Total 37 11 30 26 70 


Breakdown of 37 cases with bilateral papillomatosis based on rela- 
tionship to anterior commissure. Percentages shown are based on 
denominator of 37 (bilateral involvement). 


Tables | and 2 delineate the geographic distribu- 
tion of the disease with relation to the anterior com- 
missure, and whether phonomicrosurgical microflap 
resection was done after PDL irradiation. Photoco- 
agulation of the microvasculature of both vocal folds 
was done without resection in 26 of the 37 cases of 
bilateral disease (70%), resulting in total or near-to- 
tal involution of disease without anterior commissure 
web formation. In cases with large exophytic lesions, 
some of the exophytic component was removed, leav- 
ing islands of epithelium at the lesions’ base (Fig 1). 
In the 13 cases in which epithelium was removed, 
there was minimal bleeding associated with the resec- 
tions after PDL therapy (Fig 2). There were no anes- 
thetic or surgical complications. 


DISCUSSION 


The use of the 585-nm PDL for glottal papilloma- 
tosis was effective in enhancing the eradication of 
disease without the clinically observed soft tissue 
complications associated with the CO2 laser?6 (ther- 
mal damage, tissue necrosis, superficial lamina pro- 
pria scarring, anterior commissure web formation). 
The presumed mechanism of papilloma regression 
is the selective destruction of the subepithelial mi- 





Fig 1. Endoscopic views (original x4; courtesy of Endocraft LLC, Providence, Rhode Island). A) Massive papillomatous 
changes are seen in glottal introitus. B) Pulsed dye laser is used to ablate microvasculature throughout anterior commissure 
region with partial cytoreduction of disease. Islands of epithelium remain from base of exophytic disease to preclude anterior 
commissure web formation. 
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Fig 2. Endoscopic views (original x7; courtesy of Endocraft LLC). A) Bilateral sessile papillomatous masses are seen on 
medial surface of both vocal folds. B) Fiber of pulsed dye laser is directed at left lesion to coagulate subepithelial microvascu- 
lature. C) Left-sided lesion has been resected. Microvasculature of right lesion has been coagulated by laser, which is well 
seen as fold is retracted. D) Diseased epithelium can be resected as precise microflap, since microvasculature of superficial 
lamina propria is coagulated. Some subepithelial ecchymosis can be seen in right vocal fold. 


crovasculature with resulting ischemia to the diseased 
epithelium.?! Cells whose nutritional demand is high- 
er than that of the surrounding normal cells because 
of growth from HPV infection would be more sensi- 
tive to a decrease in the microcirculation. Albeit not 
permanent, this microsurgical “angiolysis” approach 
restricts survival and growth of neoplastic epithe- 
lium while minimizing cytotoxicity to the delicate 
layered microstructure of the vocal fold. Tan et al?! 
and others37-38 have demonstrated that tissue injury 
caused by PDL is confined to the perivascular re- 
gion, with thrombus formation within the vessels, 
endothelial cell necrosis, and red blood cell extrava- 
sation, and does not disturb the surrounding struc- 
tures (Fig 1). Studies by Valdez et al,3° McMillan et 
al,2? and Bower et al?4 have demonstrated that the 
PDL can be used relatively safely on laryngeal struc- 
tures at similar laser pulse energies (present study, 
549 mJ per pulse; Valdez et al study, 572 mJ per pulse; 
Bower et al study, 392 to 589 mJ per pulse; McMillan 
et al study, 342 to 572 mJ per pulse) to effect clinical 
regression of papillomatosis. 


Nearly every subject in this study had undergone 
multiple procedures to control his or her RRP. It is 


impossible to predict when disease recurrence will 
require operative intervention, since the behavior of 
laryngeal papillomatosis is extremely variable, both 
between patients and over time in individuals. Papil- 
lomatous lesions have been known to spontaneously 
regress, both on the skin and in the larynx. Assess- 
ing new technology is difficult because of this highly 
variable pattern. Operative intervention does not nec- 
essarily correlate directly with the recurrence of dis- 
ease, but is influenced by such non-disease factors 
as the surgeon’s operative philosophy and skill, as 
well as the patient’s vocal needs and ability to sched- 
ule time away from work. It is evident that PDL treat- 
ment of papilloma is not curative and that the true 
disease-free interval after PDL treatment can only 
be assessed with continued surveillance of these pa- 
tients. 


It is readily apparent that PDL treatment offers re- 
lief of tumor burden and minimizes the attendant 
long-term consequences of vocal fold scarring that 
results from repeated surgical ablative procedures 
(CO2 laser or cold instrument) required by dissec- 
tion within the superficial lamina propria. Because 
the epithelium is not fully removed, bilateral treat- 
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Fig 3. Photomicrograph displays pulsed dye laser—in- 
duced cleavage plane between epithelial basement mem- 
brane and superficial lamina propria. Vascular core of 
papillomatous lesion is identified with intravascular co- 
agulation of blood. Notice that epithelial cytoarchitec- 
ture has not been altered, despite effective laser treat- 
ment. 


ment of disease is possible. Anterior commissure dis- 
ease (medial or internal surface), which is usually 
treated with staged operations, can be treated simul- 
taneously, and thus the number of general anesthetic 
procedures, with their attendant morbidity and cost, 
is minimized. 


Clinical observation at the time of PDL therapy re- 
vealed that there were interesting and useful responses 
of the vocal folds’ layered microstructure to the la- 
ser energy. A cleavage plane developed at the epithe- 
lial basement membrane in those areas in which the 
PDL was used, facilitating an ultraprecise cold instru- 
ment resection if that therapeutic pathway was cho- 
sen (Fig 3). This phenomenon has not been identi- 
fied in PDL-treated skin. It may be attributable to de- 
naturation of basement membrane linking proteins 
that anchor the epithelium to the superficial lamina 
propria.*? In addition, there was enhanced microsur- 
gical precision due to improved hemostasis. Because 
the microvasculature was photocoagulated, there was 
optimal visualization of the highly magnified tissue 
planes of the superficial lamina propria. 


An anecdotal observation was that a majority of 
subjects spontaneously reported that their voices im- 
proved significantly more than they had after previ- 
ous CO2 or phonomicrosurgical cold instrument pro- 
cedures. In fact, many thought that their vocal qual- 
ity approached their “normal” (pre-disease) state af- 
ter the vocal fold edema and purpura resolved (>2 to 
3 weeks). The mechanism for this “improvement” 
in the voice is putative, but unexplained. It may be 
attributed to the possible rearrangement of collagen 
fibers that results in softening of superficial lamina 
propria scar. Bower et al?* also noted this effect of 
PDL treatment. 


A carefully designed study is under way to objec- 
tively analyze the vocal results from this heteroge- 
neous population with varied numbers of prior inter- 
ventions and variable disease characteristics (ie, geo- 
graphic location). Understandably, disease on the 
medial or vocalizing surface of the true vocal folds 
will result in greater dysphonia than disease on the 
superior and lateral surfaces of the true vocal folds. 
Because of this variability, historical controls will 
not apply to these patients; each individual will serve 
as his or her own control. 


The PDL is currently being used in dermatology 
to soften cutaneous scar’; it is thought to induce col- 
lagen remodeling. This same beneficial effect might 
take place in the vocal folds, in which increased pli- 
ability would translate into improved mucosal wave 
propagation and a reduction in dysphonia. A mam- 
malian study has been designed to examine the ef- 
fects of PDL energy on vocal fold collagen to eluci- 
date this most interesting clinical observation. It is 
interesting to note that when the laser energy was 
applied to normal-appearing epithelium, there were 
no observable deleterious effects on the tissues. The 
properties of the PDL and its various delivery sys- 
tems theoretically enable its use in the outpatient set- 
ting, and so would obviate the need for general an- 
esthesia. 


Despite the assets of PDL treatment, there are 
shortcomings to this technology. The laser is expen- 
sive, and is not likely to be present in most surgical 
suites. From this study, it is apparent that PDL therapy 
is less effective in the treatment of exophytic versus 
sessile lesions because of the superficial penetration 
(approximately 2 mm) of the laser energy. This out- 
come may be an artifact of the relatively low power 
settings used in this study. The Photogenica V laser 
is capable of 5 J per pulse; the typical power output 
in this study was 0.55 J per pulse. It is likely that 
exuberant exophytic lesions require an increase in 
power for the laser energy to reach the vasculature 
at the core of the papillomatous lesions. This ques- 
tion will be examined in future studies. Increasing 
the laser’s wavelength would allow deeper penetra- 
tion into tissues and would be beneficial in cases of 
exuberant papillomatosis. 


In this study, lesions that were perpendicular to 
the axis of the straight cleaved fiber were optimal 
for effective irradiation. Medial-surface lesions re- 
quired significant retraction of the vocal fold. The 
inner aspect of the anterior commissure would be 
better treated with a side-firing tip. A delivery sys- 
tem that could be set to several angles (30°, 45°, 90°) 
would help to ensure proper irradiation of difficult 
areas such as the medial surfaces of the vocal folds 
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and the subcordal anterior commissure region. 


In its present configuration, the Photogenica V la- 
ser operates at 1 Hz. The PDL takes much more time 
than the CO2 laser to treat a similar area of disease. 
Supraglottic disease was treated with the CO2 laser, 


. because scarring here does not lead to dysphonia, 


and the larger-volume disease was impractical to treat 
with the current PDL model (ie, treatment would take 
too long). There are plans to increase the frequency 
to 2 Hz; this change would decrease the time under 
anesthesia. This increased frequency is especially im- 
portant when several thousand pulses are to be deliv- 
ered. Increasing the repetition rate, as well as the spot 
size (to 3 to 5 mm per pulse) and the laser energy 
(fluence), should decrease future treatment times. 


Because of the vascular injury, there is a charac- 
teristic extravasation of erythrocytes into the perivas- 
cular spaces. This purpura reflects the associated ede- 


ma of the superficial lamina propria, which results 
in postoperative dysphonia for 2 to 3 weeks. No sub- 
ject has described degradation in vocal quality sub- 
sequent to PDL treatment, and most reported a no- 
ticeable improvement in their dysphonia.”4 


Future treatment strategies of papillomatosis will 
probably target the virus itself with agents designed 
to interfere with key replicative and custodial viral 
activities. Ideally, treatment that does not have sys- 
temic toxicity or deleterious local permanent effects 
is desirable. Development of a surgical treatment ap- 
proach that does not require general anesthesia would 
help reduce the morbidity, and at times mortality, as- 


` sociated with glottal papillomatosis. With technologi- 


cal improvements and an increased appreciation for 
the capabilities of PDL therapy, it is hoped that this 
treatment paradigm can be transferred from the oper- 
ating room to a clinic setting. 
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SURGICAL VOICE REHABILITATION AFTER LASER SURGERY FOR 
GLOTTIC CARCINOMA 
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Transoral laser surgery today is the mainstay of treatment for T1 and T2 glottic carcinoma. The vocal ability remains sufficient 
in the majority of patients. However, in some cases, a significant glottic gap may persist, leading to poor voice quality. We report a 
special technique of medialization thyroplasty using autologous cartilage specifically adapted for vocal rehabilitation after laser 
resection. Six patients with a significant glottic gap following laser surgery were treated. For vocal rehabilitation, a special medialization 
technique was performed. The superior rim of the thyroid cartilage of the resected side was exposed. A 1 x 2-cm piece of cartilage 
was harvested and reimplanted into a subperichondrial pouch created on the inner side of the thyroid cartilage. When phonation was 
optimal, this cartilaginous strut was sutured and/or glued in place. In all 6 cases, the vocal function improved significantly. The 
dysphonia index (0 = normal, 3 = aphonia), which includes objective parameters as well as expert voice ratings and the patient’s 
perception, increased by 1.1 on average (range, 0.4 to 1.6). The results have been lasting. The established medialization techniques 
are of limited value in a larynx scarred by laser surgery. Injection augmentation is often futile because the tight scar tissue does not 
lend itself to augmentation. Implantation of nonorganic material may cause problems if revision surgery for tumor recurrence should 


become necessary or if the implant protrudes into the scarred endolarynx. The technique reported avoids these pitfalls and leads to 


voice quality improvement comparable to that of established medialization procedures. 


KEY WORDS — laser surgery, medialization thyroplasty, voice rehabilitation. 


INTRODUCTION 


Nowadays, transoral laser surgery (TLS) is one of 
the preferred treatment options for T1 and T2 glottic 
carcinoma in most centers in Europe, as well as in 
the United States.!,2 Numerous studies’ data on local 
contro]; organ preservation, and survival have proven 
that the results of this treatment method are equal or 
even superior in terms of safety, efficacy, and cost- 
effectiveness to those of radiotherapy or conventional 
partial laryngectomy.?-8 The question of postopera- 
tive vocal function has been addressed much less fre- 
quently. In general, the phonatory function after laser 
cordectomy is believed to be tolerable, since the ma- 
jority of patients regain their ability to communicate 
effectively.? While open partial laryngectomy leads 
to comparatively worse results in terms of vocal func- 
tion,!° radiotherapy seems to have less impact on 
voice quality.!°.! However, TLS sometimes undoubt- 
edly leaves patients with poor vocal ability due to 
insufficient glottic closure. In contrast to glottic in- 
competence caused by paralysis, in postsurgical cases, 
the commonly used vocal fold medialization tech- 
niques often lead to unsatisfying results because of 
the scarred endolaryngeal tissue. We present the ap- 


plication of a special medialization technique for 
voice rehabilitation and its preliminary results in a 
selected group of patients. 


SUBJECTS AND METHODS 


Patient Population. Surgical voice rehabilitation 
was offered to patients if the following criteria for 
eligibility were fulfilled: 1) a significant glottal gap 
on videostroboscopy; 2) unsatisfying voice improve- 
ment after intensive speech therapy; 3) absence of 
any disease in the resected area, ie, leukoplakia, in- 
flammation, or granulation; 4) an interval of at least 
12 months without any signs of recurrence between 
TLS and surgical voice rehabilitation; and 5) signifi- 
cant dissatisfaction of the patient with his or her abil- 
ity to communicate in social and/or professional life. 


Under these guidelines, 6 patients opted for sur- 
gery. When we obtained informed consent, it was 
outlined to every patient that the surgery planned was 
not a standard procedure and that voice improvement 
was not guaranteed. All patients had undergone laser 
cordectomy for Tla (5 patients):or T2 (1 patient) 
squamous cell carcinoma of one vocal cord. A mean 
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Fig 1. Donor site of autolozous cartilage graft. 


of 16.8 months had passed between TLS and surgical 
voice rehabilitation. Follow-up visits performed on 
a regular basis had ruled out tumor recurrence. All 
patients were male, and the mean age was 56.6 years 
(range, 36 to 78 years). 

Surgical Technique. Four patients were operated 
on under local anesthesia, and 2 under general anes- 
thesia. In all cases, a fiberoptic laryngoscope was 
used to control the endolaryngeal situation. The flex- 
ible endoscope was inserted through the nose after 
surface anesthesia in patients operated on under local 
anesthesia. For general anesthesia, a laryngeal mask 
was used, which allowed for insertion of a flexible 
laryngoscope. No tracheostomy was carried out in 
any of the cases. A horizontél neck-crease incision 
was made at the level of the mid-thyroid cartilage on 
the side of the laser resection. The strap muscles were 
retracted laterally or partially cut in a horizontal plane 
to expose the thyroid ala. The perichondrium was 
incised along the upper edge of the thyroid cartilage 
and mobilized inferiorly to the level of the thyroid 
notch on both external and internal surfaces of the 
cartilages. From the upper rim of the thyroid cartilage, 
an almost rectangular piece cf approximately 1 cm 
in height and 2.5 cm in length was harvested (Fig 1) 
with an oscillating jigsaw (“asteoscalpel”). A small 
Roux retractor was positioned parallel to the inner 
aspect of the thyroid cartilage to prevent accidental 
damage to the paraglottic space during this task. Care 
was taken not to include the thyroid notch into this 
resection to prevent an asymmetric appearance (es- 





Fig 2. Preparation of pocket in subperichondrial space 
from superior direction. 
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Fig 3. Cartilaginous strut placed in subperichondrial space, 
forcing scarred site of laser resection into more medial po- 
sition. 


pecially for male patients). Elevation of the inner 
perichondrium was performed with utmost caution 
to prevent puncturing the scarred laser-ablated endo- 
larynx (Fig 2). In all cases, a sufficient pocket for 
the cartilage graft could be created without accidental 
entering of the larynx. The cartilage graft was trimmed 
individually and placed into the prepared pocket (Fig 
3). When surgery was performed under local anes- 
thesia, the patient was asked to phonate in order to 
reveal the optimal graft position. In every case, fiber- 
optic control was used to monitor the medialization 
process. In most cases, no additional fixation was 
necessary, because of the tight fit of the graft in its 
pocket. If the cartilage strut seemed to be somewhat 
mobile, Z anchoring sutures were laid through the 
graft and the adjacent thyroid cartilage, or the graft 
was sealed in its position with fibrin glue. 


Diagnostic Tools. Two speech therapists and a 
trained laryngologist independently rated each voice 
before and after the operation. Voices were evaluated 
according to the GRB scale (grade of hoarseness, 
roughness, breathiness), which has been described 
in detail elsewhere. !2 l 


Videostroboscopy was performed on every patient 
to reveal the mechanism of phonation, and the session 
was tape-recorded to allow for repeat analysis. 


A simultaneous registration of both pitch and in- 
tensity range (defined as the difference between max- 
imal and minimal sound pressure levels recorded at 
a 30-cm microphone distance) was produced in every 
case. For this procedure the terms “phonetogram” 
and “voice range profile” have been coined. The tech- 
nique, validation, and clinical use have been de- 
scribed elsewhere. }3.!4 The resulting graph gives com- 
plex information that cannot be expressed as a nu- 
meric variable; therefore, comparisons or statistical 
operations are not feasible. To overcome this short- 
coming, we extracted 2 characteristic parameters 
from each phonetogram: 1) the maximal dynamic 
range in decibels, which is the difference between the 
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TABLE 1. DEFINITIONS FOR DYSPHONIA INDEX 


Variable Scores 

Grade of hoarseness (score) 0 1 2 3 
Maximal dynamic range (dBA) >24 18-24 12-17 <12 
Pitch range (halftones) >45 35-45 25-34 <25 
Maximal phonation time (s) >15 11-15 7-10 <7 
Communicative ability (score) 0 1 2 3 


To calculate dysphonia index, add scores of each item and divide by 
5. See text for further explanations. 














loudest and the softest tones a patient was able to pro- 
duce, and 2) the number of halftones a patient could 
sing after the tones were presented from a piano. 


In addition, the maximal phonation time for the 
sustained vowel /a/ was measured in seconds. The 
ability to communicate effectively was assessed with 
a standardized score system. In this system, a com- 
pletely unimpaired communication was coded as 0. 
Minor impairment under effort, but not in everyday 
social communication, was recorded as 1. A 2 was 
given for voices that showed significant restrictions 
under minor effort, but allowed for everyday social 
communication. Finally, a score of 3 represents mas- 
sively impaired vocal function with a negative impact 
on social communication. 


The data of these variables are summarized in a 
dysphonia index by adding score points for each pa- 
rameter (Table 1). For the parameters grade of hoarse- 
ness and communicative ability, the scoring system 
outlined above is used. If more than 24 halftones can 
be produced by the patient, the result is coded as a 
score of 0. Between 18 and 24 halftones yields a score 


of 1, between 12 and 17 halftones yields a score of ` 


2, and fewer than \12 halftones yields a score of 3. 
Accordingly, a maximal dynamic range of more than 
45 dB is given a score/of 0; 35 to 45 dB a score of 1; 
25 to 34 dB a score of 2; and less than 25 dB a score 
of 3. A maximal phonation time of more than 15 sec- 
onds is scored as 0; 11 to 15 seconds is scored as 1; 7 
to 10 seconds is scored as 2; and less than 7 seconds 
is scored as 3. For calculation of the dysphonia index, 
the scores of each parameter are added and divided 
by 5 (as 5 variables are involved). In this system, a 
dysphonia index of 0 represents a normal voice, and 
a dysphonia index of 3 represents an extremely dis- 
turbed voice.!5 


RESULTS 


Before surgical voice rehabilitation, the mean dys- 
phonia index of all 6 individuals was 2.67. After the 
operation, the dysphonia index had improved by 1.14, 
to an average of 1.53. The preoperative and postop- 
erative data for each patient are listed in Table 2. 
Figure 4 gives an endoscopic view of the preoperative 
state of the larynx of a patient, and Fig 5 depicts the 
postoperative result. 


DISCUSSION 


Voice results following laser treatment of the 
larynx have been the subject of discussion since this 


. method was introduced in the late 1970s. Whereas 


radiotherapy has a good reputation in terms of voice 
preservation,!®!8 surgical treatment is suspected to 
be more mutilating. However, objective data have 
shown that radiotherapy does not automatically lead 
to a near-normal voice,!? and some authors have 


TABLE 2. INDIVIDUAL PREOPERATIVE AND POSTOPERATIVE DATA 








Patient 1 Patient 2 


Patient 3 


Patient 4 Patient 5 Patient 6 





Preop Postop Preop Postop Preop Postop Preop Postop Preop Postop Preop  Postop 





Variable 


RD Sc RD Se RD Sc RD Se RD Sc_RD Sc_ RD Sc RD Se RD Sc RD Sc_RD Sc RD Sc 





Grade of 

hoarseness 

(score) 2 2 1 1 3 3 1 1 3 3 
Maximal 

dynamic 

range (ABA) 17 1 16 1 7 3 17 1 23 1 
Pitch range 

(halftones) 14 317 3 7 3°17 3 33 2 
Maximal 

phonation 

time (s) 2 3 4 3 9 2 9 210 2 
Communica- 

tive ability 

(score) 32) 3 2 22.3 3 2 2 3 3 
Dysphonia 

index 2.4 2.0 2.8 1.8 2.2 


13 3 2s 2. Bk BP A 3 SD 


0.8 2.8 1.8 2.6 1.4 3 14 


RD — raw data, Sc — score. Patient 1 had T2 glottic carcinoma; other patients had T1 glottic carcinoma. Respective ages (years) of patients 1 
through 6 were 65, 36, 54, 78, 58, and 49. Intervals (months) between transoral laser surgery and surgical voice rehabilitation in patients 1 


through 6 were, respectively, 14, 18, 12, 22, 21, and 14. 
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Fig 4. (Patient 1) Preoperative endoscopic view. Note re- 
sected anterior commissure and partially resected false 
vocal fold on right side. 


found the voice results following laser treatment to 
be very similar.*°-?> Our own data on the phonatory 
results following TLS revealed that satisfactory voice 
production was regained in the majority of cases.’ 
Only in a small fraction of T1 and T2 glottic carci- 
noma patients did severe dysphonia persist. These 
observations raise the question of what the factors 
are that can adversely influence the postoperative 
voice result. Koufman?¢ made an observation (based 
on experience and not on clinical data) that web for- 
mation is the most important negative factor, fol- 
lowed by preoperative radiotherapy. According to 
Koufman’s observation, dissection of the anterior 
commissure is much more damaging to the vocal abil- 
ity than is removal of a large portion of the vocalis 
muscle, or even resection of a vocal process. This ob- 
servation found support in our investigations, which 
showed that involvement of the anterior commissure 
has a greater impact on vocal ability than does the 
extent of resection. 


01.07.99 
14:45:04 


3 UO POST-OP 


Fig 5. (Patient 1) Postoperative endoscopic view. Note 
supraglottic bulk on right side causing almost complete 
glottic closure. 








08.11.99 
12:45:58 


14 WO POST OP TYP IIIB 


Fig 6. (Patient 3) Endoscopic view 14 weeks after trans- 
oral laser surgery. Note absence of scar formation on re- 
sected left side at glottic level. 


From our observations, the individual pattern of 
scar formation is another key factor. While some pa- 
tients build up a pseudocord with effective glottic clo- 
sure, in other cases, scar formation is only minimal. 
Since the surgical technique was identical in all these 
cases, individual factors seem more likely to be in- 
fluential. Although uneventful wound healing with 
little tendency toward scar formation is desired after 
surgical interventions in all areas of the human body, 
after TLS, some scarring, at least, seems to be bene- 
ficial because of its bulk effect in the laser-ablated 
larynx. Figure 6 shows a good example of a complete 
absence of scar formation and persistence of a severe 
glottic gap 14 weeks after complete laser cordectomy. 


The indication for surgical voice rehabilitation 
should be based primarily on the patient’s requests 
and needs.’ Of course, all of the medical criteria out- 
lined above must be fulfilled for a patient to be eli- 
gible for such a procedure. If a patient perceives his 
or her vocal function as sufficient, no attempt should 
be made to direct him or her toward surgery, even if 
the phonatory function should objectively be poor 
and the chances for improvement by surgery might 
seem good. Conversely, requests for surgical voice 
rehabilitation should be rejected if the criteria of eli- 
gibility are not fulfilled. Our algorithm for patient 
selection usually starts with spontaneous complaints 
by the patient. Typically, the patients asks in one of 
the follow-up visits whether his or her voice “will 
remain so poor forever.” If endoscopic assessment 
reveals a favorable situation, the possibility of surgi- 
cal voice rehabilitation is suggested to the patient in 
detail. 


From an oncological standpoint, surgical manipu- 
lation in an organ recently treated for malignant dis- 
ease is at least debatable. We believe that a follow- 
up period of at least 12 months should be completed 
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before surgical interventions for functional improve- 
ment can be considered. This threshold, although set 
arbitrarily, is derived from many observations sug- 
gesting that recurrences of glottic carcinoma usually 
occur within the first year after treatment.46 Never- 
theless, the chance of tumor recurrence after surgical 
voice improvement must be dealt with in planning 
such a procedure. Dysphonia due to insufficient glot- 
tic closure can be treated very effectively by either 
approximation laryngoplasty?8 or endolaryngeal aug- 
mentation. In terms of long-term results, alloplastic 
materials like silicone, hydroxylapatite, or titanium 
are superior to autologous implants, which are subject 
to resorption to an unpredictable degree. On the other 
hand, in case of recurrence, autologous cartilage has 
several advantages. Medialization is performed from 
an external approach; thus, there is no alteration of 
the original tumor site that might prevent early detec- 
tion of recurrent disease. If recurrence does occur, 
revision surgery can be performed relatively safely 
in the presence of cartilage that was implanted sub- 
mucosally. In contrast, implantation of foreign mate- 
rial for medialization, although very effective in vocal 
fold palsy,!5.29-32 would probably require explanta- 
tion before revision surgery. For the same reason, in 
laryngeal cancer patients, we would be very reluctant 
to use a nonresorbable and nonexplantable material 
such as silicone; which otherwise represents an al- 
most ideal substance for injection augmentation.33 
In theory, collagen injections could be a fast and safe 
alternative for correction of glottic insufficiency. 
However, after occasional use, we found this tech- 
nique not useful for voice rehabilitation after TLS. 
Since the injection site in a laser-resected larynx con- 
sists of tight scar tissue, it is impossible in most cases 
to deposit sufficient amounts of collagen, which flows 
back easily through the puncture hole because of its 
low viscosity. Building up a sufficient bulk, including 
the required overcorrection, was hardly ever accom- 
plished in our hands. The same restrictions also apply 
to autologous fat augmentation. 


The surgical procedure presented in this paper is 
derived from a technique for external vocal fold me- 
dialization described by Tucker.» A technique quite 
similar to our own, except for some details, is used 
by Remacle et al.5 In contrast to the Isshiki type I 
thyroplasty,>° in our technique the subperichondrial 
space is approached from a superior direction, instead 
of through a window in the thyroid cartilage. This 
approach allows for harvesting of considerably more 
cartilage from the free upper rim of the thyroid carti- 
lage. In addition, the wide exposure of the inner peri- 
chondrium decreases the risk of entering the larynx, 
which is of special importance in the laser-ablated 
and therefore potentially scarred paraglottic space 


after TLS. In this respect, monitoring the endolarynx 
through a fiberoptic laryngoscope is also an addi- 
tional safety feature. The ensuing step in the Tucker 
technique consists of trimming a cartilage wedge into 
a triangular shape and implanting this graft in the 
prepared pocket. This step was not feasible in our 
cases, because the sizes and shapes of the paraglottic 
pouches varied widely, depending on the individual 
site and extent of resection, as well as local scar for- 
mation. After its creation, the paraglottic pocket was 
filled with cartilage, with little regard to the size or 
shape of this wedge. If truly paraglottic medialization 
could not be accomplished, the cartilaginous implants 
were placed deliberately at a more supraglottic level 
in order to build up as much endolaryngeal bulk as 
possible. Tucker recommends temporary tracheos- 
tomy for this procedure. We did not carry out trache- 
ostomy in any of the cases presented, yet no airway- 
related problems were encountered. Basically, the op- 
eration is comfortable for the patient under local anes- 
thesia. However, because of the quite extensive prep- 
aration needed, general anesthesia may be regarded 
as advantageous in many cases. For those cases, we 
prefer the use of a laryngeal mask for ventilation, 
since it allows an undisturbed view of the open glot- 
tis.37 Since the maximum amount of medialization 
is usually limited by factors such as scar formation 
or impending rupture of the inner laryngeal lining, 
the role of phonatory feedback for optimizing the 
implant position is of much less importance in this 
group of patients than in ordinary thyroplasty. Al- 
though we still prefer local anesthesia, we consider 
general anesthesia only slightly less advantageous in 
this procedure. 


So far, no patient has been implanted for longer 
than 2 years, so long-term results are difficult to as- 
sess. Up to now, voice improvement has been lasting 
and stable in all cases. In theory, the mechanical stress 
induced by phonatory vibrations might lead to a re- 
duction of cartilaginous bulk. Further investigation 
is warranted to assess the real risk of cartilage decay. 
Whether the theoretical option of revision surgery 
for reimplantation of cartilage in the case of resorp- 
tion would actually be feasible cannot be answered 
today, as revision surgery for that purpose has not 
yet been performed. 


CONCLUSIONS 


Surgical voice rehabilitation can be a relatively 
safe and successful treatment option for those few 
patients who have severe dysphonia after TLS for 
early glottic carcinoma. In our series of 6 patients, 
significant voice improvement could be accom- 
plished in every case, and was documented by ob- 
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jective, as well as subjective, voice assessment. No 
adverse effects related to the surgery were observed. 
Patient selection should follow strict guidelines. The 
most important criteria are a recurrence-free follow- 
up interval of at least 12 months, an absence of path- 
ological alterations in the resection area at the time 
of surgery, a significant glottal gap on videostrobos- 
copy, and the expressed desire of the patient to have 
a better voice. As the material of medialization, auto- 


logous cartilage is to be preferred. We believe that 
approaching the paraglottic space from a superior 
direction is safer than excising a window in the thy- 
roid cartilage, because the risk of entering the scarred 
endolarynx is decreased. Long-term results are not 
yet available for this technique. In theory, degradation 
of the implanted cartilage may occur over time. The 
use of heterologous implants in place of autologous 
cartilage remains to be investigated in the future. 
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ELECTRICALLY STIMULATED GLOTTAL OPENING COMBINED WITH 
ADDUCTOR MUSCLE BOTOX BLOCKADE RESTORES BOTH 
VENTILATION AND VOICE IN A PATIENT WITH 
BILATERAL LARYNGEAL PARALYSIS 


DAVID L. ZEALEAR, PHD 
NASHVILLE, TENNESSEE 
MARK S. COUREY, MD 
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The purpose of this study was to determine whether paced electrical stimulation of the posterior cricoarytenoid muscle with an 
implantable device could restore ventilation in a patient with bilateral vocal fold paralysis without disturbing voice. In the first US 
case of a multi-institutional study, this patient was implanted with an Itrel II stimulator (Medtronic, Inc). In monthly postoperative 
sessions over an 18-month period, an effective stimulus paradigm was derived, the magnitude of stimulated vocal fold abduction and 
ventilation was measured, and perceptual judgments of voice quality were made. After identification of optimum parameters, poste- 
rior cricoarytenoid muscle stimulation produced a moderately large vocal fold abduction of 4 mm, but only marginal improvement in 
mouth ventilation, with no change in voice quality. After adductor muscle blockade with botulinum toxin, the patient’s voice im- 
proved with increased phonatory airflow, but ventilation through the passive airway was still inadequate. However, by combining 
these two therapeutic strategies, dynamic abduction increased to 7 mm, ventilation through the mouth surpassed that through the 
tracheotomy (allowing decannulation), and voice quality was restored to normal. 


KEY WORDS — bilateral vocal fold paralysis, botulinum toxin, electrical stimulation, laryngeal pacing, posterior cricoarytenoid 


muscle, synkinesis, ventilation, voice. 


INTRODUCTION 


Bilateral laryngeal paralysis is a complicated and 
often life-threatening clinical problem. Airway en- 
largement procedures for bilateral vocal fold paraly- 
sis (BVFP), such as cordotomy and arytenoidectomy, 
permanently impair voice and increase the risk of 
aspiration. An innovative strategy for restoring venti- 
lation without affecting voice or swallowing has been 
termed laryngeal pacing. It involves functional elec- 
trical stimulation of the posterior cricoarytenoid 
(PCA) muscle to abduct the vocal folds during inspi- 
ration. During noninspiratory phases, the vocal folds 
passively relax to the midline to allow for normal voic- 
ing and airway protection. 


Basic issues of laryngeal pacing have been ad- 
dressed in both acute and chronic animal models over 
the past 2 decades.!-8 These studies revealed efficient 
electrode designs, practical implantation approaches, 
and optimal stimulus paradigms for long-term stimu- 
lation of laryngeal muscle. Subsequently, the effi- 
cacy of laryngeal pacing was directly demonstrated 
in the human with an external pacing circuit. Percu- 
taneous needle electrodes were used to deliver stimuli 


to the abductor muscles.? More recently, preliminary 
clinical trials with an implantable laryngeal stimula- 
tor have been conducted in Europe and the United 
States!.!! (also Zealear et al, unpublished observa- 
tions). The results of this study have demonstrated 
the merit of laryngeal pacing as a new treatment mo- 
dality for BVFP. Of 6 patients implanted success- 
fully, 5 showed stimulated abduction and return of 
ventilation sufficient for decannulation. Three were 
subsequently decannulated. Further, there were no 
deleterious effects of laryngeal pacing on voice over 
more than 12 months of study. The present report 
chronicles our experience with the first patient im- 
plant in the United States, and describes the prob- 
lems encountered and solutions employed through- 
out the course of investigation. 


METHODS 


This study was approved by the Vanderbilt Insti- 
tutional Review Board, Committee for the Protec- 
tion of Human Subjects, and informed consent was 
obtained from the patient. 


The patient was a 63-year-old woman with an 18- 
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month history of BVFP following thyroidectomy. She 
underwent tracheotomy 2 months after the operation. 
At the initial visit to our institution, she was unable 
to tolerate closure of the tracheotomy tube for more 
than a few seconds. 


The general innervation status of laryngeal mus- 
cles was assessed by means of needle electromyog- 
raphy. The responsivity of the PCA muscle to stimu- 
lation was evaluated with a percutaneous needle and 


` an external pacing circuit. The Itrel II device was 


then implanted. An incision was made in the neck, 
and the electrode was inserted into a subperichondrial 
pocket between the PCA muscle and its cartilage. 
The electrode lead was brought subcutaneously to a 
second incision below the clavicle, where the implant- 
able pulse generator (IPG) was positioned. After im- 
plantation, the IPG could be programmed through 


- the skin by an external transmitter in order to change 


stimulus parameters. In monthly postoperative ses- 
sions, an effective stimulus paradigm was derived, 
the magnitude of stimulated vocal fold abduction and 
associated ventilation was measured, and the effect 
of stimulation. on voice production was assessed. The 
primary goal of the 18-month investigation was to 
determine whether the device could restore ventila- 
tion through the mouth equal to that through the tra- 
cheostoma without disturbing voice. If so, mouth 
ventilation would be deemed sufficient to permit re- 
moval of the tracheotomy tube and closure of the 
tracheostoma (ie, decannulation). 


During each visit, a rigid 70° endoscope was posi- 
tioned in the oral cavity to directly visualize the vo- 
cal folds. The effectiveness of different stimulus pa- 
rameters in producing vocal fold displacement and 
the magnitude of displacement were investigated. 
Two trained observers made independent measure- 
ments of vocal fold abduction from video still frames, 
and the average value was tabulated. The distance 
between the vocal folds was measured first at rest 
and then at peak abduction during stimulation of the 
PCA muscle. Vocal fold width was used as the refer- 
ence for measurement. The width was found to be 3 
mm in sessions conducted with a laser ruler attached 
to the end of the endoscope shaft.!? The laser diodes 
housed in the ruler emitted 2 parallel 635-nm beams 
of 3 mW into the observation field. The beams pro- 
duced 2 small spots on the vocal fold of fixed inter- 
spot distance (10 mm) independent of endoscope 
magnification. 


Ventilatory measurements were made with a Sen- 
sorMedics model 2200 computerized spirometer. 
Calibration of the system was maintained between 
trials by an autocalibration feature, and checked 
manually with a 3-L calibration syringe on a daily 


basis. All tests were performed by a trained techni- 
cian in the Pulmonary Function Laboratory of The 
Vanderbilt Clinic. During a test, the patient breathed 
into the spirometer and inspired maximally to total 
lung capacity (TLC). She then performed a maxi- 
mum forced expiration to residual volume (RV) fol- 
lowed by a maximum forced inspiration back to TLC. 
The standard battery of ventilatory parameters were 
obtained to index vital capacity (TLC — RV) and air- 
flow rate during inspiration and expiration. However, 
measurements of volume and airflow during the in- 
spiratory phase were the primary interest of this study 
(ie, functional inspiratory vital capacity [FIVC], and 
peak inspiratory flow [PIF]). The PIF was the most 
critical measure, since it varied (inversely) with the 
level of airway resistance during inspiration and re- 
flected the extent of stimulated vocal fold abduction. 
The magnitude of PIF governed the level of meta- 
bolic activity expenditure and determined the degree 
of patient activity that could be tolerated. With the 
mouth and nose occluded, the PIF through the tra- 
cheotomy site was obtained as a reference of the op- 
timum level of patient ventilation. The tracheotomy 
and nose were then occluded, and PIF was measured 
through the mouth with the device on. Approxima- 


. tion of values to those of the tracheotomy site would 


determine whether laryngeal pacing could restore 
ventilation through the mouth. Finally, PIF was mea- 
sured through the mouth with the device off, with 
the tracheostoma site and nose still occluded, to de- 
termine whether ventilation through a passive air- 
way was at a level necessary to sustain life in the 
event of device failure. In order to minimize vari- 
ability of data due to changes in patient effort, as 
many as 5 trials were run for each task, so that the 
intertrial variation for at least 3 was less than 10%. 
The average value of the 3 trials was calculated, and 
the data were displayed graphically. 


Voice evaluation was performed before surgery 
and at the monthly postoperative visits. The laryn- 
geal pacemaker was turned off for the testing ses- 
sions in order to compare data to preoperative val- 
ues. The GRBAS scale (grade, roughness, breathi- 
ness, asthenicity, and strain) was used to assess per- 
ceptual changes in voice quality.!3 A number from 0 
(none) to 3 (severe) was assigned to the GRBAS de- 
scriptors of quality. Measures of mean phonatory flow 
rate in reading a 10-second portion of the Rainbow 
Passage were obtained with the Nagashima Phona- 
tory Function Analyzer. Three trials were averaged, 
and the data were displayed graphically. 


RESULTS 


As is often observed in patients with BVFP after 
thyroidectomy, all laryngeal muscles had some rein- 
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nervation, as shown electromyographically. How- 
ever, the reinnervation was faulty, as indicated by 
the inappropriate firing patterns of motor units. For 
example, inspiratory motor unit activity was recorded 
from the thyroarytenoid (TA) muscle that was char- 
acteristic of PCA motor units. In particular, TA in- 
spiratory unit activity increased with a deep inspira- 
tion or sniff, as shown in Fig 1A. Typically, TA in- 
spiratory activity detected during quiet breathing!* 
is repressed or is absent during these inspiratory 
tasks!> (Fig 1B). Both the inadequacy and the synki- 
netic nature of reinnervation likely accounted for the 
vocal fold immobility in this patient. Stimulation of 
the left PCA muscle with an external pacing circuit 
produced vocal fold abduction of 4 to 5 mm in syn- 
chrony with inspiration. In view of these observa- 
tions, the patient was considered a good candidate 
for a laryngeal pacemaker. 


In postoperative sessions, it was determined that 
a 2.5-second train of 1-ms pulses delivered at a fre- 
quency of 40 Hz and an amplitude of 3 to 6 V effec- 
tively produced a dynamic airway. Although previ- 
ous animal studies found 2 ms of duration as the op- 
timum pulse width for recruiting both reinnervated 
and nonreinnervated muscle fibers, the maximum 
pulse width that the Itrel II could deliver was 1 ms.8 
A frequency of 40 Hz generated a fused, tetanizing 
muscle contraction and a smooth vocal fold abduc- 
tion. Two and a half seconds of stimulated abduc- 
tion allowed sufficient time for inhalation with each 
breath. The device was set to deliver 10 stimulus 
trains every minute to match the patient’s respira- 
‘tory rate at a moderate level of activity. (The device 
in its present form is not synchronized with respira- 
tion.) The ideal stimulus amplitude was one that 
evoked maximum vocal fold opening without induc- 
ing discomfort or nociception. During stimulated ab- 
duction, the patient could feel the increased flow of 


0.2 mV 
200 msec Fig 1. Electromyographic record- 
ings from thyroarytenoid (TA) 
ii ~ muscle show motor unit activity 
Inspiration A) during both phonation and in- 


spiration (ie, sniff) in patient with 
bilateral vocal fold paralysis, but 
B) only during phonation in nor- 
mal subject. 


air, which provided a cue to inspire. The patient ap- 
peared to spontaneously entrain her breathing cycle 
with that of the stimulation. 


Figure 2 is a histogram of the static and stimulated 
airway measurements made in postoperative ses- 
sions. Over the first few sessions, the initial stimu- 
lated vocal fold opening of 2.5 mm gradually in- 
creased until the optimal stimulus paradigm was iden- 
tified. By 3 months, the stimulated abduction was 
measured at 4.0 mm, which was significantly larger 
than the 0.5-mm static airway. At 6.5 months, the 
loss in abduction was attributed to a slight shift in 
electrode position less favorable for PCA activation. 
It was hypothesized that the shift could have caused 
greater recruitment of nearby nerves supplying ad- 
ductor muscles, which antagonized glottal opening. 
In order to nullify the antagonistic response, a short- 
acting neuromuscular blocking agent, pancuronium 
bromide, was injected into the ipsilateral TA muscle 
at 7.5 months. Within 10 minutes, dynamic abduc- 
tion was restored to a level greater than that observed 
in previous sessions. The success with pancuronium 
bromide provided a rationale for injection of botuli- 
num toxin (Botox). As shown in subsequent sessions, 
the stimulated abduction following Botox treatment 
was maintained at a higher level. Repeated injections 
of Botox over the next several months not only maxi- 
mized the dynamic airway (7.0 mm), but directly en- 
larged the passive airway (2.5 mm). 


The device malfunction at 10 months was due to 
loss of the electrode channel that served as the an- 
ode. Since the electrode had only 2 channels, the 
stainless steel case of the IPG was employed as the 
anode, and the highest level of abduction was re- 
gained. 


Figure 3 summarizes the ventilation data, graphi- 
cally illustrating the flow rate of inspired ‘air mea- 
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sured at each session. Airflow through the mouth with 
the device on (filled circles) was always greater than 
that with the device off (filled squares). This finding 
was not unexpected, since the device was observed 
endoscopically to produce vocal fold abduction. How- 
ever, before Botox administration, mouth breathing 
with the device on was dramatically less than trache- 
otomy breathing (open circles). Air exchange was so 
limited that the patient could not tolerate tracheosto- 
ma closure for more than a few minutes. This inade- 
quacy in ventilation was surprising and appeared in- 
consistent with the relatively large glottal opening 
viewed endoscopically during stimulation. The ex- 
planation lay in the fact that the patient’s glottis in 
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Fig 3. Graph of peak inspiratory airflow (PIF) over course 
of study. PIF was measured through tracheotomy site 
(open circles), and through mouth with device on (filled 
circles) and with device off (filled squares). “B” indi- 
cates each session in which patient received Botox in- 
jection. Normative value of PIF is indicated (arrow). 
Decannulation was performed at 18 months. 
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the “off ” phase was so constricted that air exchange 
during expiration was severely compromised. After 
Botox therapy, the larger static airway increased the 
volume of air exchange (ie, vital capacity) during 
both inspiration and expiration with the device ei- 
ther on or off. The volume of air that could be passed 
through the mouth on inspiration, FIVC, equaled that 
which could be passed through the tracheostoma. For 
the first time since her tracheotomy, the patient could 
sit quietly with her tracheostoma closed for many 
hours. Furthermore, when the device was activated, 
the patient could engage in more demanding levels 
of activity, such as standing or walking. As shown in 
Fig 3, the airflow during stimulated abduction was 
sufficient to support this higher level of activity, while 
airflow through the passive glottis was not (11 to 24 
months). At 17 months, the PIF through the mouth 
with the device on was measured at 1.82 L/s, surpas- 
sing that through the tracheotomy site (ie, average 
1.77 L/s). Over the following month, the patient kept 
the tracheostoma closed 24 hours a day and relied 
exclusively upon mouth ventilation; she was decan- 
nulated at the end of the month. Since decannulation, 
PIF values through the mouth with the device on have 
ranged from 1.6 to 1.9 L/s. 


Figure 4 is a series of endoscopic views of the 
passive and stimulated airway before (Fig 4A,B) and 
after (Fig 4C,D) Botox injection. Botox administra- 
tion led to an enlargement of the static airway. Stimu- 
lated abduction of the vocal folds from this larger 
static position yielded the greatest glottal opening 
for ventilation (Fig 4D). 

The quality of the patient’s voice was evaluated at 
each follow-up visit by a senior speech-language 
pathologist. In Fig 5, the overall grade assigned at 
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each session is shown in graphical format. Before 
implantation, the patient’s voice was hyperfunctional 
and rated as moderately rough and strained, with a 
score of 2.0. Her voice did not change after the im- 
plant surgery, nor did it change after 6 months of 
electrical pacing. However, upon unilateral Botox in- 
jections in the TA muscle at 7.5 and 8 months, her 
voice became less strained and rough, and her voice 
quality improved to a grade of 1.0. With subsequent 
Botox injections into both vocal folds, the passive 
glottis enlarged and the patient’s voice quality be- 
came normal, with a grade of 0. 


Figure 6 shows the phonatory airflow rate data ob- 
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Fig 5. Voice quality assessed at each patient visit is pre- 
sented as overall GRBAS grade and plotted as function 
of time. 


Fig 4. Endoscopic view of vocal 
folds A,C) at rest and B,D) at peak 
stimulated abduction. A,B) Be- 
fore Botox injection. C,D) After 
Botox injection. Distance between 
vocal folds was measured at rest 
and at peak abduction during 
stimulation of posterior cricoary- 
tenoid muscle. Measurements 
were taken across widest portion 
of glottal gap as illustrated by 
pairs of arrows. 


tained from the patient over the course of study. The 
airflow rates in speech were essentially unchanged 
from preoperative values with laryngeal pacing dur- 
ing the first 6 months. However, after Botox injec- 
tions into the TA muscle began at 7.5 months, there 
was a remarkable increase in airflow during speech. 
The rapid ventilatory exchange during propositional 
speech quadrupled from 33 to 119 mL/s after bilat- 
eral Botox injections, and approached more normal 
flow rates (140 to 200 mL/s).!¢ In reference to Fig 2, 
the increased phonatory airflow undoubtedly resulted 
from the increase in static airway with repeated Botox 
injections into the TA muscle. Further detail regard- 
ing the effect of chronic PCA stimulation on percep- 
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Fig 6. Mean phonatory flow rates obtained during read- 
ing of Rainbow Passage are plotted as function of post- 
operative time. 
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tual, aerodynamic, and acoustic parameters of voice 
is presented in a companion paper.!7 


DISCUSSION 


The results of this study demonstrate the promise 
of laryngeal pacing, even in the most severe case of 
airway compromise from paralysis, if combined with 
Botox therapy. In this case, the patient’s glottis was 
so constricted after synkinetic reinnervation that she 
could not expire inspired air permitted by PCA stimu- 
lation. Her PIF value measured systematically dur- 
ing the second inspiration in a test was only 0.2 L/s, 
despite a 4-mm stimulated glottal opening. This pa- 
tient was distinguished from the other 4 successes in 
multi-institutional studies!®!! (Zealear et al, unpub- 
lished observations) in that laryngeal pacing alone 
could not restore ventilation because of her severely 
constricted airway. However, once the passive air- 
way had been enlarged with Botox to permit expira- 
tion, ventilation through the mouth could be recov- 
ered by PCA stimulation. A stimulated glottal open- 
ing of 7 mm and an inspiratory airflow of 1.8 L/s 
allowed the patient to engage in normal everyday 
activity, sufficient for her eventual decannulation. 
The fact that chronic electrical pacing caused no dis- 
turbance of voice further validated the physiologic 
nature of this treatment. The improvement in venti- 
lation with null effect on voice attests to the signifi- 
cant advantage of laryngeal pacing over conventional 
surgical therapies for BVFP. 


A surprising discovery in this study was the bene- 
fit of Botox as an adjunct therapy to laryngeal pac- 
ing. Botox injection into the TA muscle enhanced 
stimulated abduction and enlarged the passive glot- 
tis. This patient was shown by electromyograph to 
have a synkinetic larynx, in which an imbalance in 
reinnervation favoring adductor over-abductor mus- 
cles was apparent. During stimulation, Botox blocked 
antagonistic activity in the adductors arising from 
inadvertent activation of the nerve fibers reinnervat- 
ing these muscles; this blocking enhanced the abduc- 
tory response. Botox also diminished the passive to- 
nus in the adductors, presenting a less constricted 
glottis for expiration and voicing. The increased air- 
flow during voice production eliminated any strain 


and returned the voice to normal quality. The in- 


creased airflow during ventilation sufficiently sup- 





ported a sedentary lifestyle, but did not permit more 
demanding levels of activity. Only when laryngeal 
pacing was superimposed on an adequate passive air- 
way was ventilation completely reestablished. 


Although useful, these approaches in their present 
form are not without limitations. Neuromuscular 
blockade of synkinetically reinnervated adductors 
may be beneficial in some patients with BVFP. How- 
ever, the nonpermanence of a Botox effect would dic- 
tate that repeated injections be made every few 
months. This study also revealed some deficiencies 
in the pacemaker device. The electrode Jacked a suf- 
ficient number of channels to ensure adequate redun- 
dancy in the face of a shift in electrode position or 
loss of a channel, both of which were encountered in 
this study. In the future, a new electrode system with 
8 channels for bilateral stimulation will be available. 
The electrode will be composed entirely of platinum- 


_ iridium, which is resistant to electrochemical corro- 


sion — an improvement over the type used in the 
present study. Another limitation of the present tech- 
nology was the lack of a sensor to pace stimulation 
with inspiratory effort. Although the stimulus rate 
was set high enough to accommodate moderate lev- 
els of activity in this patient, the rate did not auto- 
matically adjust to her ventilatory need. Remarkably, 
the patient appeared to entrain her respiratory rhythm 
to the device by using sensation of the stimulus or 
proprioception of glottal airflow. The eventual addi- 
tion of an inspiratory sensor will further refine the 
laryngeal pacemaker. 


From this case study, it can be concluded that la- 
ryngeal pacing in combination with Botox therapy 
has the capability to restore both mouth ventilation 
and voice quality in case of severe airway compro- 
mise. Since the patient was decannulated in March 
1998, she has enjoyed a normal lifestyle without any 
change in her voice or ventilatory status. If the effi- 
cacy of laryngeal pacing is further substantiated in 
subsequent patient trials, it could replace convention- 
al therapies for BVFP. However, the risk-to-benefit 
ratio would need to be considered before this treat- 
ment approach could be accepted into mainstream 
clinical practice, In general, the demonstrated suc- 
cess of this technology for laryngeal paralysis could 
provide a basis for its application to the management 
of other paralyzed head and neck muscles. 
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A human middle ear epithelial cell line (HMEEC-1) was established using human papillomavirus E6/E7 genes. HMEEC-1 has 
remained morphologically and phenotypically stable, even after 50 passages. The cells are anchorage-dependent and nontumorigenic 
when injected into nude mice. This cell line thus provides a new tool for the study of normal cell biology and the pathological 
processes associated with the epithelial cells of the middle ear in otitis media. HMEEC-1 will also be useful in the search for new 


drugs and biological agents for the treatment of otitis media. 


KEY WORDS — human middle ear epithelial cell, human papillomavirus type 16, immortalization. 


INTRODUCTION 


Even with the availability of a variety of antimi- 
crobial agents, otitis media remains one of the major 
causes of morbidity for children. Studies ón.the mo- 
lecular mechanisms of the pathogenesis of ‘human 
otitis media have been hampered by the unavailabil- 
ity of a human middle ear cell line. Although pri- 
mary cultures of middle ear epithelial cells have been 
derived from rodent! and humant middle ear tis- 
sue, the only middle ear epithelial cell lines avail- 
able to date are from gerbil,> rat, and chinchilla.” 


The aim of this study was to establish a human 
middle ear epithelial cell line in order to investigate 
the normal biology of the middle ear epithelium, as 
well as to enhance our understanding of the-molecu- 
lar pathogenesis of otitis media. Toward-this objec- 
tive, we transformed primary human middle ear epi- 
thelial cells using the E6 and E7 genes of human 
papillomavirus (HPV) type 16. The E6 and E7 open 
reading frames (ORFs) of this virus were chosen be- 
cause they are effective transforming sequences and 
have been successfully:used to immortalize. many 
types of epithelial cells.8-!3 Furthermore, the.cell lines 
immortalized by HPV have been shown to be nontu- 
morigenic, and have a nondifferentiated morphologi- 
cal phenotype. !4-23 


Human papillomavirus is choi to aka he tasi 


cell layers and replicate during epithelial cell differ- 
entiation. Only those cells with integrated copies of 
HPV DNA become permanent lines — a finding sug- 
gesting that genetic alterations caused by viral DNA 
integration and expression are necessary for continu- 
ous growth.2427 The HPV genomes in cervical car- 
cinomas and in derived cell lines are transcription- 
ally active, and the patterns of viral messenger RNA 
species are specific, with regular expression of the 
E6 and E7 ORFs.?8.29 


The E6 and E7 genes of HPV type 16 function to 
dysregulate cell growth.8-!3 The E7 protein is func- 
tionally and structurally related to the adenovirus ELA 
protein (AdE1A) and the T antigens of other papo- 
vaviruses.30-32 The E7 protein is believed to act, in 
part, by forming a complex with the product of the 
retinoblastoma tumor suppressor gene (p105-RB). 
The formation of the complex with p105-RB is me- 
diated through sequences that are conserved between 
E7, AdEIA, and the simian virus 40 large T antigen 
(SV40 T).33-38 The transforming potential of the E6 

» gene, on the other hand, has been less well defined. 
In NIH 3T3 fibroblasts, it may contribute to charac- 
teristics of the transformed phenotype such as anchor- 
age independence or tumorigenicity in nude mice. 
In human cells, E6 appears to cooperate with the E7 
oncoprotein in mediating cellular immortalization. 
It has been demonstrated that E6 binds to, and medi- 
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ates the degradation of, the cellular tumor suppres- 
sor protein p53.39-45 


We report here that using a retroviral vector to de- 
liver the E6 and E7 genes of HPV type 16, we have 
been able to successfully establish a human middle 
ear epithelial cell line. The cell line, named HEI- 
HMEEC-1 (HMEEC-1), retains many of the pheno- 
typic and morphological properties of the nontrans- 
formed, parental cultures and will be a useful tool 
for further elucidation of the molecular mechanisms 
involved in the pathogenesis of otitis media. To our 
knowledge, this is the first report of the successful 
establishment of an immortalized human middle ear 
cell line. 


MATERIALS AND METHODS 


Primary Culture. Primary cultures of normal hu- 
man middle ear epithelial cells (NHMECs) were es- 
tablished from explants of fresh human middle ear 
epithelium obtained from patients undergoing neuro- 
otologic surgery for acoustic neuromas. Pieces of 
middle ear mucosa, measuring 3 x 3 mm, were re- 
moved from the region near the tubotympanum. Af- 
ter washing twice in sterile magnesium- and calcium- 
free phosphate-buffered saline solution (PBS: 1x, pH 
7.4), each piece of mucosa was cut with a scalpel into 
three 1-mm fragments. The fragments were then 
placed into 35-mm tissue culture dishes that had been 
precoated with type I rat tail collagen (Collaborative 
Biochemical Products, Bedford, Massachusetts). All 
cells were incubated at 37°C in a humidified atmo- 
sphere of 5% CO2 and 95% air. The fragments were 
grown in bronchial/tracheal epithelial cell growth 
medium (BEGM; Clonetics, Walkersville, Maryland) 
containing penicillin (100 U/mL) and streptomycin 
(100 ug/mL). The BEGM was made per the manu- 
facturer’s instructions by supplementing a 500-mL 
bottle of bronchial/tracheal epithelial cell basal me- 
dium (BEBM) with the following: 13 mg/mL bovine 
pituitary extract, 2 mL; 0.5 mg/mL hydrocortisone, 
0.5 mL; 0.5 mg/mL human recombinant epidermal 
growth factor, 0.5 mL; 0.5 mg/mL epinephrine, 0.5 
mL; 10 mg/mL transferrin, 0.5 mL; 5 mg/mL insu- 
lin, 0.5 mL; 0.1 mg/mL retinoic acid, 0.5 mL; 6.5 
mg/mL triiodothyronine, 0.5 mL; and 50 mg/mL gen- 
tamicin, 50 mg/mL amphotericin B, 0.5 mL. After 2 
to 14 days, NHMECs began to migrate out from the 
explants and formed a monolayer. The few contami- 
nating fibroblasts were physically removed with a 
surgical scalpel under the microscope. By this tech- 
nique, primary cultures were successfully established 
from 5 of 7 donors. For passaging, the cells were dis- 
sociated with Hank’s balanced salt solution (HBSS) 
containing 0.025% trypsin—0.01% ethylenediamine- 
tetraacetic acid (EDTA; Bio-Whittaker, San Diego, 


California) and split at a 1:4 ratio. The culture me- 
dium was replaced twice per week. 


Immortalization. The PAS 17 amphotropic pack- 
aging cell line, stably transfected with a replication- 
defective retrovirus construct, pLXSN16E6-E7, cod- 
ing for HPV type 16 transforming oncoproteins E6 
and E7 (provided kindly by D. Galloway, Fred Hutch- 
inson Cancer Research Center, Seattle, Washing- 
ton),4© was grown to 70% to 80% confluence, and 
supernatants were collected after 24 hours and stored 
in aliquots at -80°C. Human middle ear epithelial 
cell cultures (passage 3; at 50% to 70% confluence) 
were infected with 1 mL of virus stock diluted in 9 
mL of fresh BEGM for 48 hours. The virus was then 
removed, and the medium was replaced with fresh 
BEGM. The cells were allowed to proliferate, and at 
70% confluence, they were trypsinized and reseeded 
in fresh BEGM medium containing G418 (Gibco- 
BRL, Gaithersburg, Maryland) at a concentration of 
0.4 mg/mL. The cells were kept in the G418-con- 
taining selection media for 14 days. After the selec- 
tion procedure, the culture medium was changed to 
BEGM without G418 and the cells were expanded. 
At this stage, half of the culture was frozen and the 
other half was used for long-term propagation. To 
propagate the cells, the cultures were split at a ratio 
of 1:4 once per week. 


Cell Freezing. Pelleted cells were resuspended in 
BEGM containing 10% dimethyl sulfoxide at a con- 
centration of 1 x 106 cells per milliliter and subse- 
quently aliquoted in 2-mL freezing tubes. The tubes 
were placed at —70°C overnight and transferred to 
liquid nitrogen the next day. 


Colony Isolation Procedure. Flasks containing col- 
onies were rinsed twice with HBSS, and the colonies 
were marked from outside the flask. Cloning rings 
dabbed with autoclaved vacuum grease were then 
used to encircle the colonies. To dislodge the cells, 
HBSS containing 0.025% trypsin—0.01% EDTA was 
added to the inside of each cloning ring (enough to 
cover the colony), and the flask was incubated at 
37°C until the cells were dissociated. The cells were 
then counted and transferred to coated T25 flasks 
containing 5 mL BEBM plus hydrocortisone and hu- 
man epidermal growth factor. 


Determination of Cell Growth. To compare the 
growth of the primary cultures and the transformed 
cells, we plated approximately 7 x 10° cells per well 
in a 24-well culture plate. The cells were counted 
every 2 days, and the population doubling time (DT) 
was calculated with the following formula: DT = 
T/x, where T = total time elapsed and x = number of 
generations. LogN = LogNo + xLog2, where N = 
final number of cells and No = initial number of cells. 
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Growth Factor Response. To measure the effect 
of various media on growth parameters, we cultured 
cells for at least 2 passages in basal medium and con- 
ditioned medium supplemented with various growth 
factors (see Primary Culture, above). Approximately 
2.5 x 104 cells per well were plated in a 6-well cul- 
ture plate. The number of cells per well was deter- 
mined every other day for 14 days. The population 
doubling times were calculated as described above. 


Morphological Studies. For light microscopy, im- 
mortalized middle ear epithelial cells were cultured 
on coverslips placed inside collagen-coated petri 
dishes. The cells were visualized and photographed 
with an inverted phase contrast microscope (Axiovert 
135, Zeiss, New York, New York). 


Immunocytochemistry. The cultured cells were 
fixed with a chilled 1:1 mixture of methanol and ace- 
tone for 5 minutes, and pretreated with 0.3% hydro- 
gen peroxide for 20 minutes at room temperature in 
order to remove the endogenous peroxidase activ- 
ity. They were treated with 1:600 normal rabbit se- 
rum for 20 minutes to block nonspecific reactions 
and then were incubated with monoclonal mouse an- 
tibodies to human pan-cytokeratin (1:100, DAKO, 
Carpinteria, California), vimentin (1:50, DAKO), and 
von Willebrand factor (1:50, DAKO) at 37°C for 2 
hours. They were then incubated with biotinylated 
anti-mouse rabbit IgG (1:200, Vector Laboratories, 
Burlingame, California) at 37°C for 30 minutes. Per- 
oxidase was attached to secondary antibody by avi- 
din biotin peroxidase complex formation. Specimens 


were incubated in diaminobenzidine tetrahydrochlo- . 


ride (DAB) to detect bounding sites of primary anti- 
body. Counterstaining was performed with Mayer’s 
hematoxylin. The same procedures were performed 
with use of nonimmunized mouse IgG (Sigma, St 
Louis, Missouri) instead of primary antibody for a 
negative control. 


Anti-ZO-I labeling was performed on cell mono- 
layers that had been fixed in 4% buffered formalde- 
hyde and permeabilized with Triton X-100. Polyclo- 
nal rabbit anti-human recombinant ZO-1 (Zymed, 
San Francisco, California) was applied to permeabil- 
ized and unpermeabilized monolayers and visualized 
with fluorescent secondary antibodies. Labeled prepa- 
rations were examined and photographed under indi- 
rect fluorescent illumination (Axiophot 2, Zeiss). 


Electron Microscopy. Cells were grown on plastic 
coverslips (Nunc, Naperville, Illinois) or on semiper- 
meable membranes (Costar Corp, Cambridge, Mas- 
sachusetts). In either case, the cells were fixed with 
2.5% glutaraldehyde in 100 mmol/L sodium cacody- 
late buffer (pH 7.2) for 2 hours at room temperature. 


After washing in cacodylate buffer, the cells were 
postfixed in 1% osmium tetroxide for 1 hour and de- 
hydrated in ethanol. The specimens were then em- 
bedded in epoxy resin (Eponate 12, Ted Pella, Red- 
mond, California). For the cells on membrane filters, 
the filters were cut into small pieces and embedded 
in flat silicone rubber molds. The cells on the cover- 
slips were inverted, cell-side-down, on silicone rub- 
ber molds filled with freshly mixed resin. The epoxy 
resin was cured at 60°C for 24 hours, and the plas- 
tic coverslip was removed from the resin block be- 
fore immersion in liquid nitrogen. Ultrathin sections 
were contrasted with uranyl acetate and lead citrate 
and visualized by transmission electron microscopy . 
(CM 120 BioTwin, FEI-Philips, Portland, Oregon). 


Soft Agar Assay for Anchorage Independence. The 
soft agar used in the anchorage independence assay 
was prepared with Noble agar (Difco, Detroit, Mich- 
igan) in BEGM media. The base agar consisted of 
0.9% agar, whereas the top agar contained 0.36% 
agar. For the assay, 2 mL of base agar was dispensed 
into 35-mm (2-mm grid) dishes and allowed to cool. 
The cell suspension containing 1 x 104 cells per mil- 
liliter in 5 mL of 0.36% top agar was then added to 
the plates. The plates were checked after 21 days, 
and any colony larger than 0.125 mm in diameter was 
counted. 


Karyotype Analysis. Acomplete karyotype analy- 
sis was carried out by Drs J. Kaplan and B. Hukku 
of the Children’s Hospital of Michigan, Detroit. 


Tumorigenicity in Nude Mice. To determine their 
tumorigenicity, we resuspended the transformed cells 
at a concentration of 5 x 107 cells per milliliter in 0.2 
mL of PBS. The cells were then injected subcutane- 
ously into the backs of 5-week-old BALB/c nu/nu 
athymic mice. Injection of 0.2 mL of PBS, without 
cells, served as the control. For a period of 6 months, 
the mice were examined weekly for evidence of tu- 
mor formation. 


Detection of HPV E6/E7 Genes by RT-PCR. Ex- 
pression of E6/E7 genes was confirmed by means of 
reverse transcription—polymerase chain reaction (RT- 
PCR). Total RNAs were extracted from primary cul- 
ture of NHMECs and HMEEC-1 at 3 different pas- 
sages by use of the RNeasy mini kit (Qiagen, Santa 
Clarita, California). One microgram of the total RNA 
was heated at 70°C for 10 minutes, quick-chilled on 
ice for at least 1 minute, and then added to the re- 
verse transcription reaction. Each reaction contained 
5 uL of 5x first strand buffer (250 mmol/L Tris—hy- 
drochloric acid [pH 8.3], 375 mmol/L potassium 
chloride, 15 mmol/L magnesium chloride), 1 mmol/ 
L dithiothreitol, 50 ng random hexanucleotide prim- 
ers, 10 mmol/L of each of the 4 deoxynucleoside tri- 
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Fig 1. Phase contrast light micrographs of cultured human middle ear epithelial cells after 4 days in culture. Explant and 
outgrowth can be clearly observed. A) Original x100. B) Original x400. 


phosphates (dNTPs), and 200 U Superscript II re- 
verse transcriptase (Life Technologies, Grand Island, 
New York) in a total volume of 20 uL. The reaction 
mixtures were preincubated at room temperature for 
10 minutes. The samples were then incubated at 42°C 
for 50 minutes before heating to 70°C for 15 min- 
utes and rapid cooling on ice. The first strand comple- 
mentary DNAs (cDNAs) were then amplified by 
PCR. The primers chosen for amplification of E6 
were as follows: 5' primer: 5'-GCAAGCAACAGT- 
TACTGCGACGT-3’; 3’ primer: 5'-GCAACAAGA- 
CATACATCGACCGG-3'. The predicted product size 
with these primers was 323 base pairs. The primers 
chosen for amplification of E7 were as follows: 5' 
primer: 5'-GATGGTCCAGCTGGACAAGC-3'; 3' 
primer: 5'-GTGCCCATTAACAGGTCTTC-3’. 
The predicted product size with these primers was 
143 base pairs. The PCR reaction mixtures contained 
5 uL of 10x PCR buffer (Qiagen, Valencia, Califor- 
nia), 10 mmol/L of each of the 4 dNTPs, 1.5 mmol/ 
L magnesium chloride, 10 umol/L of each primer, 
2.5 U Taq polymerase I (Qiagen), and diethyl pyro- 
carbonate-treated distilled water in a total volume 
of 50 uL. After an initial denaturation for 5 minutes 


at 94°C, 35 cycles of PCR were carried out in the 
Thermal Cycler 9600 (Perkin-Elmer Corp, Norwalk, 
Connecticut), with each cycle consisting of denatur- 
ation at 94°C for 1 minute, annealing at 60°C for 45 
seconds, and extension at 72°C for | minute. The du- 
ration of the final elongation reaction was increased 
to 7 minutes at 72°C to permit completion of reac- 
tion products. The amplified gene-specific PCR prod- 
ucts were electrophoresed ina 1.2% agarose gel con- 
taining 0.1 ug/mL ethidium bromide. Each PCR prod- 
uct was sequenced with the 310 Gentetic Analyzer 
(ABI PRISM, Mountain View, California). 


RESULTS 


Primary Cell Culture. Migration out of the explant 
was usually observed within 3 to 5 days after the 
initiation of the culture (Fig 1). The primary culture 
cells were uniform in shape and arranged in a typi- 
cal monolayer pattern. Some cells showed “dome” 
formation, probably due to fluid collection under- 
neath the cell monolayer (Fig 2). Initially, the cells 
took approximately 7 days to reach confluence, at 
the end of which time the cells were split at a ratio of 
1:2. By passage 4 or 5, the doubling time had slowed 





Fig 2. Phase contrast light micrographs demonstrate “dome formation,” which is due to collection of fluid beneath, in both 
A) primary and B) immortalized cells of human middle ear epithelial cell line (HMEEC-1; original x400). 
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Fig 3. Phase contrast light micro- 
graphs demonstrate response of nor- 
mal primary cells and transduced 
cells after treatment with G418 (orig- 
inal x400). Transduced cells (D-F) 
showed minimal response to G418 
treatment, whereas 90% of normal 
cells (A-C) died within 1 week. A,D) 
Before G418 treatment. B,E) Three 
days after G418 treatment. C,F) Sev- 
en days after G418 treatment. 


down as compared with earlier passages, and cells 
took significantly longer to reach confluence. This 
decrease in proliferative ability was associated with 
changes in the morphology of the cells, including 
enlargement and flattening. The cells continued to 
steadily lose their proliferative capability, and by pas- 
sage 8, they were no longer dividing and began to 
die. No evidence of spontaneous transformation was 
observed in any of the cultures. 


Cell Immortalization. Although the virally infected 
cells did initially appear to be healthy and retain the 
normal morphology of their parental primary cells, 
most of them died within 2 weeks. The surviving 
cells, those expressing the E6 and E7 transforming 
gene products of HPV, began to rapidly proliferate 
and eventually gave rise to visible colonies. These 
cells appeared to be uniform in shape and smaller in 
size than the parental primary cells. Once confluent, 
the cells were harvested and reseeded in medium 
containing G418. NHMECs were cultured with G418 
as controls. The transduced cells showed minimal re- 
sponse to G418 treatment, whereas 90% of normal 
cells died within 4 to 6 days (Fig 3). The transduced 
cells were subcloned by being seeded at a density of 
103 cells per 100-mm culture dish. After 14 days, 
clonal colonies could be seen. Multiple clones were 


E 





isolated, and each was transferred to a 60-mm cul- 
ture dish. The majority of the clones began to lose 
their proliferative capability and underwent terminal 
senescence between passages 15 and 18. However, 
1 clone, designated HMEEC-1, appears to be immor- 
talized and has been passaged more than 50 times 
during the period of 1 year without any evidence of 
a reduced proliferative capacity. This HMEEC-1 
cloned cell line was used for further characterization. 

Growth Characteristics. The average doubling time 
of the HMEEC-1 line was 23.8 hours, compared to 
33.9 hours for primary middle ear epithelial cells 
(NHMECs). The HMEEC-1 line showed better plat- 
ing efficiencies than NHMEC when seeded at low 
cell density (Fig 4A). In terms of growth factor re- 
quirement, the parental primary cells were unable to 
grow in basal medium and required the addition of 
growth factors such as insulin and EGF. The HMEEC- 
| line, on the other hand, grew well in basal medium, 
as well as conditioned medium supplemented with 
various growth factors (Fig 4B). 


Immunohistocytochemistry. All cultured cells of the 
HMEEC -1 line expressed pan-cytokeratin, a marker 
of epithelial cells (Fig 5). Expression of von Wille- 
brand factor VIII (a marker of endothelial cells) or 
vimentin (a marker of fibroblasts) was not detected 
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Fig 4. Graphs show growth rate of immortalized human 
middle ear epithelial cells (HMEEC-1 cells) and primary 
cells. A) HMEEC-1 shows more rapid growth and better 
plating efficiency than primary cells. B) Effect of growth 
factors on growth of primary cells and HMEEC-1. These 
supplements are essential for survival of primary cells, 
but HMEEC-1 can grow well in basal medium (BM), as 
well as conditioned medium (CM) supplemented with 
growth factors. 


in the HMEEC-1 line or in the parental primary cul- 
tures — a finding indicating that the cells were not 
of endothelial cell or fibroblast origin (see Table). ZO- 


CHARACTERISTICS OF HMEEC-1 











NHMECs HMEEC-1 
Life span 5-7 passages >50 passages 
Doubling time 33.9h 23.8h 
Pancytokeratin Positive Positive 
Vimentin Negative Negative 
Factor VIII Negative Negative 
Soft agar growth N/A No growth 
Tumorigenicity N/A Not tumorigenic (0/4) 
Karyotype N/A Human male 
Growth factor High Lower than that of 


requirement NHMECs 


HMEEC- | — human middle ear epithelial cell line, NHMECs — 
normal human middle ear epithelial cells, N/A — not available. 





| immunolabeling showed diffuse cytoplasmic local- 
ization in the HMEEC-1 cell line, instead of a more 
typical hexagonal cell pattern around the cell periph- 
ery (data not shown). 


Electron Microscopy. The HMEEC-1 cells showed 
evidence of epithelial cell characteristics such as mi- 
crovillar formation on the apical cell surface and ex- 
pression of maculae adherens (desmosomes) on baso- 
lateral surfaces (Fig 6). The cytoplasm contained 
electron-opaque, filamentous bundles, interpreted to 
be keratin, which sometimes were associated with 
desmosomes. Zonulae occludens (tight junctions) 
and zonulae adherens (intermediate junctions) were 
not detected. 


Soft Agar Assay for Anchorage Independence. To 
test for anchorage independence, we grew both the 
HMEEC-1 and the parental, untransformed primary 
human middle ear epithelial cells (NHMECs) in soft 
agar. Even after 21 days, no sign of growth was visi- 
ble for either cell type. 


Karyotype Analysis. Karyotype analysis was per- 





with normal mouse serum shows that none of cells are labeled. 
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tron-opaque filamentous bundles that originate from desmosomes and run through cell cytoplasm. Scale bar — 400 nm. B) 
Higher-power view of macula adherens (desmosome) shows details of ultrastructure. Presence of zonula occludens (tight 
junction) is not apparent. Scale bar — 200 nm. 


formed on the HMEEC-1 cells at passage 20. The 
results of the karyotype analysis indicate that the cells 
possess a Y chromosome and have chromosome 
counts in the range of 39 to 47 (hypodiploid). The 
average chromosome number for HMEEC-1 cells 
was 43, and the Y chromosome was present in about 
40% of metaphases examined. Some of the autosomes 
appeared abnormal in appearance, suggesting the 
presence of gross deletions or translocations (Fig 7). 


Tumorigenicity in Nude Mice. The HMEEC-1 cells, 
as well as the parental primary human middle ear 
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Fig 7. Karyotypic analysis of HMEEC- 
1. Immortalized cells are composed of 
normal near-diploid human epithelial cells 
with random chromosomal changes. 


epithelial cells (NHMECs), were tested for tumori- 
genicity in nude (nu/nu) mice. Subcutaneous injec- 
tion of 107 cells into the animals did not result in tu- 
mor formation, even after 6 months, for either cell 
type (data not shown). 


Detection of HPV E6/E7 Genes. Transcripts of E6 
and E7 genes were constitutively expressed in 
HMEEC-| cells at 8, 25, and 40 passages, respec- 
tively, whereas these viral transcripts were not ex- 
pressed in NHMECs (Fig 8). Sequence analysis of 
each amplified cDNA fragment demonstrated a 100% 
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Fig 8. Integration of human papillomavirus 16 E6 and E7 messenger RNAs (mRNAs) in 
HMEEC-] analyzed by 35 cycles of reverse transcription—polymerase chain reaction. Polyme- 
rase chain reaction—amplified DNAs on 1.2% agarose gel electrophoresis. 326 base pair (bp) 
bands represent presence of mRNA for E6 gene, and 143 bp bands represent E7 gene. These 


expressions are independent of passages. Lane A 





primary cultured cells of human middle 


ear epithelium; lane B — HMEEC-1 P-8 (passage 8); lame C — HMEEC-| P-25 (passage 


25); lane D — HMEEC-1 P-40 (passage 40). 


homology with the corresponding regions of each 
gene (data not shown). 


DISCUSSION 


Interaction of bacterial surface molecules with re- 
ceptors on the surface of the middle ear epithelial 
cells is one of the most crucial steps in the develop- 
ment of otitis media, although much of this interac- 
tion is poorly understood.*’#? In order to elucidate 
the cellular and molecular mechanisms involved in 
the pathogenesis of otitis media, it is important to 
understand the normal biology of middle ear epitheli- 
um, and such studies require large numbers of mid- 
dle ear epithelial cells. Although there have been 
several reports on the establishment of primary cul- 
tures and cell lines of middle ear epithelial cells in 
rodents,!- to date there have been no reports of im- 
mortalized human middle ear epithelial cells. This is 
in part due to the fact that in contrast to mouse cells, 
spontaneous immortalization of human cells is a very 
rare event, and immortalizing genes need to be used 
in order to generate immortalized cell lines. Several 
genes have proved to be efficient in immortalizing 
cultured cells. These include, among others, SV40 
T, the papillomavirus E6 and E7 genes, the AdEIA 
gene, oncogenes, and mutants of the p53 gene. 


We have been able to successfully derive a human 
middle ear epithelial cell line, HEI-HMEEC-1 
(HMEEC-1), by using a replication-defective retro- 
virus construct, pLXSN16E6-E7, coding for the HPV 
type 16 transforming oncoproteins E6 and E7.50-52 
The cell lines derived by transformation with HPV 
E6 and E7 genes have been shown to be nontumor- 
igenic,'®!8 and in agreement with these reports, 
HMEEC-| cells were also observed to be nontumor- 
igenic and anchorage-dependent. Furthermore, kar- 
yotype analysis revealed that the cell line is male- 
derived and hypodiploid. Moreover, HMEEC-1 cells 
maintain the capacity of their parental cells to pro- 
duce secretory gene products such as lysozyme and 


mucin (data not shown). Expression of E6 and E7 
genes was detected in the HMEEC-1 cells, but not 
in the parental cells — a finding suggesting that the 
elevated levels of the E6 and E7 gene products are 
responsible for immortalizing the HMEEC-| cells. 





HMEEC-| cells are similar in appearance to pri- 
mary human middle ear epithelial cells; they form 
monolayers and are hexagonal in shape. HMEEC-| 
cells are, however, considerably smaller than the pa- 
rental primary cells. When examined with a light mi- 
croscope, the HMEEC-| cells exhibit “dome” forma- 
tion, a phenomenon frequently seen in primary cul- 
tures of rat and human middle ear epithelial cells that 
is thought to be a result of the accumulation of fluid 
under the cell monolayer. Formation of the dome sug- 
gests that HMEEC-1 cells, like the primary cells, pos- 
sess active ion and fluid transport mechanisms. Fur- 
thermore, HMEEC-1 cells stain positively for cyto- 
keratin. but are negative for vimentin and factor VIII 
— findings suggesting that the cell line is epithelial 
in origin. 

The ultrastructural study of HMEEC-1 cells re- 
vealed the presence of desmosomes, a characteristic 
of epithelial cells, but full development of zonula 
occludens (tight junction) was not observed. The fact 
that ZO-| immunolabeling showed high levels of this 
protein in the cell cytoplasm demonstrates that ZO- 
1 molecules are being synthesized by these cells. That 
the typical hexagonal cell border pattern of ZO-1 
labeling usually seen in polarized epithelial cells was 
not detected may be a result of immortalization’s 
damaging either the mechanism for tight junction for- 
mation or the ZO-1 protein transport system. Simi- 
lar observations have been reported in the MDCK 
(Madin-Darby canine kidney) cell line.°3 


HMEEC- | cells also labeled positive for cytokera- 
tin, supporting our hypothesis that the filamentous 
bundles observed by transmission electron micros- 
copy are keratins. Cytokeratins are intermediate-sized 
filaments containing keratinlike proteins and are 
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found specifically in epithelial cells. Cellular cyto- 
keratin immunostaining patterns differ among cells 
on the basis of species derivation, tissue derivation, or 
presence of pathological states. It has been shown 
that the primary middle ear epithelial cells from the 
chinchilla express several cytokeratin polypeptides, 
including 7, 18, and possibly others.>4 These results 
are similar to ours and further substantiate that 
HMEEC-1 cells are of epithelial origin. Taken to- 
gether, these results suggest that the HMEEC-1 cells 
originated from epithelial cells and have retained 
many characteristics of primary epithelial cells. 


With respect to growth factor requirements, this 
cell line was less dependent on growth factors than 
primary NHMECs. This finding may be a conse- 
quence of the mutation of cellular genes by the bind- 
ing of their protein products to virus-encoded proteins 
or viral DNA.!5 For gene expression studies, synchro- 
nization of HMEEC-1 cells by depriving them of 
growth factors for 16 to 24 hours results in a reduction 
of background. The overall pattern of gene expres- 
sion, however, is not affected by the growth factor 
deprivation (data not shown). In contrast to most 
HPV-immortalized human epithelial cells, HMEEC- 
1 cells are hypodiploid and appear to have fewer gross 
chromosomal abnormalities. In transformed cells, vi- 
ral sequences are often found in the abnormal chro- 
mosomes at junctions of chromosome translocations 
or duplicated chromosome segments. Chromosomes 
1 and 3 are most commonly involved in structur- 
al aberrations. Consistent with this, we have also ob- 





served a deletion in chromosome 3 in our immortal- 
ized cells. Additionally, the cells have abnormalities 
in chromosomes 9, 13, 15, and 18. An increase in 
the number of certain chromosomes may also be crit- 
ical for immortalization, because the increase in dam- 
age of some genes could confer a selective advan- 
tage to the immortalized cells.5> Unlike normal cells, 
immortalized cells readily convert to neoplastic cells 
because of their inability to arrest the cell cycle and 
repair DNA when challenged with genotoxic agents 
such as chemical carcinogens. Our results demon- 
strate that immortalization apparently does not result 
in tumorigenicity, as assessed by soft agar assay and 
nude mouse techniques. we 


In conclusion, the HPV-immortalized human mid- 
dle ear cell line, HMEEC-1, possesses many structur- 
al features and immunocytochemical characteristics 
of the parental primary middle ear epithelial cells. 
The cell line has retained many characteristics of the 
parental primary culture, as evidenced by the ultra- 
structural findings, chromosomal analysis, and cyto- 
keratin staining patterns. The availability of an im- 
mortalized human middle ear epithelial cell line will 
aid in our efforts to isolate and characterize bacterial 
receptors, as well as in those to study the normal bi- 
ology of middle ear epithelial cells. Understanding 
the fundamental biology of middle ear epithelial cells 
will ultimately help us to better understand the patho- 
genesis of otitis media and will facilitate our efforts 
to find effective preventative and therapeutic drugs, 
particularly for otitis media with effusion. 
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Olfactory evoked potential is considered a useful method of electrophysmological olfactometry for the diagnosis of olfactory 
disturbance. However, electrophysiological olfactometry is not as widely used as electrophysiological audiometry, such as the audi- 
tory brain stem response, because odor stimulation is difficult to perform. In contrast, electrical pulse stimulation is easy to perform, 
and its evoked potential is also easily recorded by the averaging method. We recorded olfactory evoked potentials from the scalp 
produced by electrical stimulation on the olfactory mucosa and investigated the relationship between this electrical olfactory evoked 
potential (EOEP) and the results of Toyoda and Takagi’s perfumist’s strip method (T&T) olfactometry, which is a standard Japanese 
means of psychophysical olfactometry. In EOEP-detectable cases, the detection and cognitive thresholds of T&T olfactometry were 
1.32 + 1.99 (mean + SD; n = 10) and 3.02 + 1.64 (n = 10), respectively. But in the undetectable cases, the thresholds were 4.67 + 2.03 
(n = 8; 5.8 means off the scale) and 5.80 + 0.00 (n = 8), respectively. The differences between the T&T thresholds of EOEP- 
detectable and -undetectable cases were significant. We concluded that EOEP `s suitable for electrophysiological olfactometry. 


KEY WORDS — electrical stimulation, evoked potential, olfactory disturbance, T&T olfactometry. 


INTRODUCTION 

The first step in diagnosing olfactory disturbance 
is to accurately identify olfactory ability. Unlike audi- 
ometry, methods of olfactometry are not standardized 
globally. Several methods, such as Toyoda and Taka- 
gi’s perfumist’s strip method (T&T olfactometry; 
Japan),! the University of Pennsylvania Smell Identi- 
fication Test (UPSIT; United States),? and Sniffin’ 
Sticks (Germany),* are used in different parts of the 
world. 

Measurement of the olfactory evoked potential 
(OEP) is considered a useful method of electrophys- 
iological olfactometry for patients with olfactory dis- 
turbance. However, electrophysiological olfactom- 
etry is not as widely used as electrophysiological au- 
diometry, because pulse odor stimulation is not easily 
performed. Pioneers designed special odor delivery 
equipment for stimulation in order to record the 
evoked potential.4© Although some of these suc- 
ceeded in obtaining clinical data for OEP in olfactory 
dysfunction,” these applications were only used for 
research purposes. Positron emission tomography!° 
and functional magnetic resonance imaging!! are also 
useful tools to determine human olfaction, but they 
are difficult to use as common clinical tests because 
of their high cost. 


Electrical stimulation of the olfactory mucosa 
evokes potential change on the scalp. We called this 
the electrical olfactory evoked potential (EOEP).!? 


The EOP is thought of as the potential change of 
the olfactory bulb. It is not a somatosensory potential, 
because of its latency and localization in the frontal 
region of the scalp.!* The short latency of the EOEP, 
about 20 ms, is similar to that of the rabbit EOEP!? 
originating in the olfactory bulb, and indicates the 
impossibility of cortical origin. The EOEP can be 
recorded with a regular evoked potential measure- 
ment system widely used in general hospitals to re- 
cord aucitory brain stem response, electrocochleog- 
raphy, or electromyography data. Therefore, if a rela- 
tionship between EOEP and the olfactory threshold 
can be revealed, it will be a useful method for the 
electropaysiological diagnosis of olfactory distur- 
bance. 


MATERIALS AND METHODS 


Fourteen patients, 10 male and 4 female, who con- 
sulted our department to check their olfaction were 
tested. They were informed about the study and 
signed cocuments of consent following the World 
Medical Association Declaration of Helsinki Rec- 
ommencations guiding physicians in biomedical re- 
search involving human subjects. The age of the pa- 
tients was 53.9 + 14.9 years (mean + SD). 


All subjects except patient 7 underwent standard 
bilateral T&T olfactometry! (see Table). The thresh- 
olds of T&T olfactometry for 5 odorants, B-pheny1 
ethyl alcohol (odorant A), methyl cyclopentenolone 
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PATIENT DATA 
T&T Olfactometry 
Thresholds 
Patient Age Sex Complaint Background Side EOEP Latency(ms) Detection Cognitive 
1 19 M Normal Hypogonadism R + 25 1.8 2.2 
2 46 M Parosmia Head trauma L + 18 3.0 4.2 
3 56 M Anosmia Side effect of tegafur R — 5.8 5.8 
4* 45 F Hyposmia Post common cold R + 24 0.8 3.4 
5 73 F Anosmia Unknown L - 4.0 5.8 
6 6 M Parosmia Unknown L + 20 -1.8 1.4 
7 5 M Hyposmia Olfactory meningioma R + 20 1.4 1.8 
L + 18 3.6 4.2 
8 53 F Anosmia Post common cold E - 5.8 5.8 
oF 65 M Hyposmia Unknown R — 
L - 0.8 5.8 
10 55 M Anosmia Unknown L - 5.8 5.8 
11 64 M Hyposmia Unknown R + 21 
L + 33 0.8 4.2 
12 47 F Anosmia Tubercular sellar meningioma R - 
post operation 
L - 5.8 5.8 
13 344 M Normal Pituitary adenoma L + 25 (30) -0.2 1.2 
14 77 M Hyposmia Unknown L + 20 4.8 5.8 


For electrical olfactory evoked potential (EOEP), plus means detectable and minus means undetectable. 
T&T olfactometry thresholds are bilateral, except in patient 7. In patient 13, latency of late peak of twin peaks is indicated in parentheses. 


*Complained of both olfactory and taste disturbances (hypogeusia). 


+Complained of constant bitter taste in mouth. 


(B), isovaleric acid (C), y-undecalactone (D), and ska- 
tole (E), were averaged. The 8 degrees (—2 to +5) of 
the odorant on T&T olfactometry represent the con- 


centration series 10-2 to 105, and zero represents the — 


average detection threshold of normosmic subjects.! 
When the highest concentration was not responded 
to, the score was regarded as 6, except in the case of 
odorant B, for which the score was regarded as 5. 
Because lateralized olfactory dysfunction was sus- 
pected in patient 7, lateralized T&T olfactometry was 
performed instead of bilateral T&T olfactometry. Bi- 
lateral olfactory thresholds are represented by the su- 
perior side,'4 so the threshold of the superior side 
was substituted for the bilateral threshold to convert 
the lateralized threshold to a bilateral threshold of 
T&T olfactometry. 


The EOEP was recorded from the frontal and later- 
al parts of the scalp (Fig 1). The intensity of stimu- 
lation was 2 mA, and its duration was 0.5 ms. The 
stimulation current was fed to the olfactory mucosa 
of the subject via bipolar electrodes. Three hundred 
trials were averaged. Four channels of EOEPs were 
amplified, filtered (2 to 250 Hz), and recorded. Most 
patients had deviation of the nasal septum. When it 
was difficult to insert the stimulation electrode in both 
olfactory clefts, the wider nasal cavity was chosen 
for stimulation. As the olfactory cleft is narrow, the 
silver sphere that is the tip of the stimulating electrode 
contacts the septal and lateral sides of the olfactory 


mucosa; the silver sphere contacts only one side with 
a wide-opened olfactory cleft. If the contacted surface 
area of the silver sphere is large, the current density 
will be decreased, and the stimulation will not be 
enough to excite the olfactory nerve. Some patients 
could be tested bilaterally. Before testing, the olfac- 
tory cleft was opened by treatment with gauze with 
0.1% epinephrine, and a veteran otolaryngologist, 
using a speculum, placed the stimulating electrode 
on the olfactory mucosa. The patients were slightly 
sedated by monosodium trichlorethyl phosphate syr- 
up (Tricloryl, Glaxo Wellcome, London, England, 
20 to 25 mL given orally); Tricloryl syrup is often 
used for sleep activation in clinical electroencepha- 
lography (EEG), because it induces a state similar to 
natural sleep.!> This slight sedation is effective in 
decreasing background EEG. All the methods of mea- 
suring EOEP were applied in accordance with our 
previous report.!2 


The definition of the EOEP is as follows: 1) the 
peak is observed 20 to 30 ms after stimulation; 2) 
the amplitude of the EOEP is the highest at the frontal 
scalp on the stimulated side; and 3) the polarity of 
the differential EOEP is reversed at the border be- 
tween the stimulated and nonstimulated sides of the 
frontal scalp. 


The patients’ olfaction was divided into 4 catego- 
ries: normosmia, hyposmia, anosmia, and parosmia, 


520 Ishimaru et al, Olfactory Evoked Potential 


R1 Rf REFLf Ll 





To an electric 
stimulator 


15 mm <— 


oe 100 mm ——> 
ae ta en 
se $65 mm — 


B 2.5 mm 





Fig 1. Location of recording electrodes (A) and schema 
of stimulating electrode (B). A) Four active electrodes 
were located in right lateral (R1), right frontal (Rf), left 
lateral (L1), and left frontal (Lf) areas of scalp. Reference 
electrode (REF) was located in upper frontal area of scalp. 
Earlobe on stimulated side served as ground (GND). B) 
Tip of stimulating electrode was made of silver sphere 
0.5 mm in diameter. 


depending on their complaints. 


Because the EOEP is a lateralized phenomenon, it 
seems that the relationship between EOEP detect- 
ability and the olfactory threshold level of lateralized 
T&T olfactometry is more significant than the rela- 
tionship between EOEP detectability and the olfac- 
tory threshold level of bilateral T&T olfactometry. 
However, olfactory ability is usually similar on both 
sides in olfactory sensory disturbance, except in cases 
caused by a brain tumor.!© Therefore, we used bilater- 
al T&T olfactometry in this preliminary report on 
clinical EOEP. 


RESULTS 
EOEPs AND T&T OLFACTOMETRY 


Detectable Cases. The EOEPs were detectable 
from 10 sides of 8 patients (see Table). We regarded 
the EOEP as detectable when right or left stimulation 
of the olfactory mucosa evoked an EOEP. The olfac- 
tion categories of these patients were normosmia, hy- 
posmia, and parosmia. A typical EOEP, from patient 
7, is shown in Fig 2A. An EOEP was recorded at 
high amplitude in the frontal part of the stimulated 
side (Fig 2A, Rf). No responses were recorded with- 
out electrical stimulation (Fig 2B). For the bilateral 
T&T olfactory threshold, the detection threshold was 
1.32 + 1.99 (mean + SD) and the cognitive threshold 
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Fig 2. Electrical olfactory evoked potentials (EOEPs) of 
clinical cases. Potentials were recorded from 4 active 
electrodes, which were located on right lateral (RI), right 
frontal (Rf), left frontal (Lf), and left lateral (L1) areas of 
scalp. Left column (A,C,E) indicates stimulated stage, 
and right column (B,D,F) indicates unstimulated stage. 
EOEP of Rf had highest amplitude in patient 7. In patient 
13, amplitude of EOEP of Lf was highest when right ol- 
factory cleft was stimulated. No evoked potentials were 
detected at unstimulated stage. Arrow indicates electrical 
stimulation, and negative peak of EOEP is indicated as 
asterisk. Gray record of Lf in patient 13 is another trial 
of same patient. Twin peaks were reproducible in current 
experiment. 


was 3.02 + 1.64. None of the patients in this category 
complained of lateralized olfactory dysfunction. 
There were twin peaks at 25 ms and 30 ms in patient 
13. We regarded the peak latency of patient 13 as 25 
ms. The peak latency of the EOEPs was 22.4 + 4.6 
ms (n = 10). During electrical stimulation, no sense 
of smell occurred, in concordance with our previous 
report.!? When odorants stimulate the olfactory mu- 
cosa, selected olfactory receptor cells are excited, 
but nonselected olfactory receptor cells are excited 
during electrical stimulation on the olfactory mucosa. 
We suspected that the excitation of olfactory receptor 
cells of all odorant receptor types was not cognitive 
and called it “white smell,” akin to the concept of 
white noise. ! 


Undetectable Cases. Four patients had T&T de- 
tection thresholds of 5.8 — in other words, off the 
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scale. No EOEPs were recorded with or without elec- 
trical stimulation. A typical case, that of patient 10, 
is presented (Fig 2C,D). There were 8 sides of 6 pa- 
tients in this category. In bilateral T&T olfactometry 
of 6 sides, the detection threshold was 4.67 + 2.03 
and the cognitive threshold was 5.80 + 0.00. 


Case With Twin EOEP Peaks. A case of EOEP with 
twin peaks, in patient 13, is shown in Fig 2E,F. These 
twin peaks were reproducible (n = 2) in the current 
experiment. The latencies of the twin peaks, are 25 
ms and 30 ms after stimulation, respectively. 


DISEASE AND EOEPs 


Olfactory Meningioma. Lateralized T&T olfac- 
tometry was performed in a case of meningioma (pa- 
tient 7). The right and left EOEPs were recorded sep- 
arately. The difference between the right and left la- 
tencies was 2 ms. However, the T&T olfactogram 
showed left-sided hyposmia (see Table). 


Common Cold. Patient 4 had post-common cold 
hyposmia. The patient was in the process of recover- 
ing when the EOEP was recorded. The peak latency 
was 24 ms after stimulation. On T&T olfactometry, 
the detection threshold was within the normal range, 
but the cognitive threshold was higher than the nor- 
mal threshold. 


COMPLAINTS AND EOEPs 


Two patients with no olfactory complaints for 
whom investigation of olfaction was ordered by an- 
other department showed well-evoked EOEPs. In 5 
patients who complained of hyposmia, EOEPs were 
detectable in 4 patients and not detectable in 1 patient. 
No EOEPs were detectable in any of the 5 patients 
who complained of anosmia. In 2 patients who com- 
plained of parosmia, EOEPs were detectable. 


DISCUSSION 


The mean latency of the first peak in the EOEPs 
of 5 subjects was reported to be 19.4 ms after stimu- 
lation.!2 The mean latency (+SD) of the present study 
was 22.4 + 4.6 ms (n = 10). The difference in the 
latencies is thought to be a result of the difference in 
methods; in the former study, the latency of grand 
averaged EOEP is reported, and the latter study re- 
ports the average of EOEP latencies. 


The amplitude of background EEGs of the EOEP- 
undetectable cases was lower than that of the EOEP- 
detectable cases. It is well known that the olfactory 
bulb submits the induced wave during smelling, but 
the background EEG of the olfactory bulb is also 
known.!? We often found that the amplitude of back- 
ground EEGs was augmented after electrical stimu- 
lation of the olfactory mucosa (data not shown). A 
lower amplitude of the background EEGs themselves 


may indicate a less-activated olfactory bulb. 


The difference in the detection thresholds of T&T 
olfactometry between EOEP-detectable cases and 
EOEP-undetectable cases is discussed. Eighteen sides 
of the nose were examined. The lateralized T&T ol- 
factory threshold of patient 7 was regarded as the 
threshold of the right side, because the bilateral olfac- 
tory threshold is represented by the superior side. 
Ten sides were EOEP-detected and 8 sides were 
EOEP-undetected. The difference in the T&T detec- 
tion thresholds between EOEP-detected and -unde- 
tected sides was significant (Student’s f-test, p < 
.005). The difference in the T&T cognitive thresholds 
between EOEP-detected and -undetected sides was 
also significant (Student’s t-test, p < .005). 


Although the T&T detection threshold was within 
the normal range for patient 9, the EOEP was not 
detected. The cognitive threshold was off the scale 
in this case. Because EOEP is thought of as an olfac- 
tory bulb—level response, hyposmia may be caused 
by bulbar or olfactory epithelial disturbance. The de- 
tection of T&T odorants was possibly trigeminal. But 
this patient’s condition was also combined with hypo- 
geusia, which may indicate that he also had central 
olfactory disturbance at the upper olfactory bulbar 
level. In patient 14, the detection and cognitive T&T 
olfactory thresholds were 4.8 and 5.8, respectively. 
Although the cognitive threshold of 5.8 is off the 
scale, the EOEP was detectable. Therefore, the focus 
of olfactory dysfunction was thought to be at the up- 
per bulbar level. 


In this study, the diagnostic value of the EOEP is 
statistically proven, but one must carefully interpret 
the results of EOEP detection or nondetection in the 
case of central olfactory disturbances, because the 
EOEP is the potential change within the olfactory 
bulb, and upper olfactory bulbar problems may not 
be reflected. 


Patient 13 experienced EOEPs with twin peaks. 
Similar EOFPs were reported in rabbits. Yamamoto!® 
reported, regarding twin peaks, that the second peak 
is obscured at a very high strength of stimulus and is 
suppressed by a single shock to the anterior commis- 
sure in rabbits. Yamamoto!8 also reported that an in- 
travenous injection of pentobarbital sodium sup- 
presses the second peak in rabbits. Although patient 
13 had pituitary adenoma, we have no proof of a re- 
lationship between intracranial disease and the sec- 
ond peak. We have no evidence that Triclory] influ- 
ences these peaks. Therefore, it is difficult to con- 
clude why twin peaks were observed in this study. 

As electrophysiological olfactometry, EOEP can- 
not diagnose the degree of olfactory disturbance. The 
amplitude of the EOEP is not affected by the olfactory 
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threshold. It is thought that the amplitude is signifi- 
cantly affected by the electroconductance of the 
scalp. When the nasal frontal sinus is located between 
the electrode and the olfactory bulb, which is thought 
to be the current sink of the EOEP, the amplitude of 
the EOEP may decrease. The degree of development 
of the nasal frontal sinus is thought to affect the am- 
plitude of the EOEP. 


The stimulation conditions are 2 mA and 0.5 ms. 
A stimulation current of less than 2 mA is too weak 
to evoke the EOEP, but more than 2 mA induces pain. 
It seems that a strength of 2 mA is adequate. It is 
generally said that electrophysiological measurement 
of the sensory system is better for obtaining evoked 
potentials that originate from the lower sensory cen- 
ters. In the history of electrophysiological audiom- 
etry, the slow vertex response (SVR) was used in the 
preliminary period, but now the ABR is in general 
use for electrophysiological audiometry, rather than 
the SVR. The reason is that the ABR is not affected 
by the consciousness of the subject. Recently, the elec- 


trical ABR (EABR) has been used for cochlear im- 
plant fitting.!? The EABR of cochlear implant users 
provides information about the auditory system as 
well as does the ABR of subjects who do not use co- 
chlear implants. Therefore, the EOEP is expected to 
be useful for diagnosing the olfactory system, as is 
the EABR for the auditory system. 


Previous articles on electrophysiological olfactom- 
etry often used the OEP originating from the cortical 
brain. There is a similarity between the OEP in olfac- 
tometry and the SVR in audiometry. Reported OEPs 
were produced by odorous air pulses. The difference 
between OEPs and EOEPs is not only the stimulation 
method, but also the latency of the peak of the evoked 
potentials after stimulation. Although electrical stim- 
ulation is less physiological than odorous stimulation, 
EOEP has the merit of being unaffected by conscious- 
ness, because it originates from the lower olfactory 
center — possibly the olfactory bulb. Therefore, the 
EOEP is a useful method for diagnosing olfactory 
disturbance, akin to the ABR in audiometry. 
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RESULTS OF FASCIAL AUGMENTATION IN UNILATERAL VOCAL 
FOLD PARALYSIS 
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The objective of this study was to evaluate the effects on voice quality of augmentation by injection of minced fascia in patients 
with unilateral vocal fold paralysis. Preoperative and postoperative voice samples from 14 patients (6 men and 8 women; mean age, 
59 years) were analyzed by computerized acoustic analysis and blinded perceptual evaluation. Statistically significant improvements 
were seen in perturbation measurements (jitter and shimmer), noise-to-harmonics ratio, and maximum phonation time. A panel of 
evaluators rated 10 of the 14 postoperative voices as normal or near-normal. Injection laryngoplasty with minced fascia offers a new, 
effective, well-tolerated, and inexpensive method to medialize a paralyzed vocal fold. The graft seems to survive well, as indicated 


by good vocal results with a follow-up ranging from 5 to 32 months. 


KEY WORDS — fascia, injection laryngoplasty, vocal fold paralysis, voice quality, voice surgery. 


INTRODUCTION 


Currently, 3 main methods exist for surgical cor- 
rection of unilateral vocal fold paralysis (UVFP): in- 
jection medialization,!? laryngeal framework sur- 
gery,** and reinnervation procedures.>.6 The first 2 
have gained in popularity. The vocal fold injection 
procedure is short and simple to perform. However, 
because of the adverse effects of foreign substances, 
Teflon’ and silicone® are less than optimal for vocal 
fold augmentation. 


The search for a more physiological material has 
led to introduction of purified bovine collagen? and 
autologous collagen extracted from skin.!9 A large 
amount of collagen (1.5 mL) is injected into Reinke’s 
space.!° It is subjected to considerable absorption and 
is potentially sensitizing.!! 


Hill et al!? and Zaretsky et al!3 have experimented 
with injected autologous fat in a canine model. Fat 
seems to be well tolerated by the vocal fold tissue, 
but absorption rates are high, resulting in a loss of 
volume after 3 months.}3 In humans, Mikaelian et 
al!4 and Brandenburg et al! have published good 
preliminary results of autologous fat injection for me- 
dialization of the paralyzed vocal fold. Shaw et al!6 
showed also, in their careful analysis of 11 UVFP 
patients with a 1-year follow-up, that long-term pho- 
natory advantage can be expected from the autolo- 
gous fat technique. On the other hand, Shindo et al!7 
stated that the results after 3 months are unpredict- 





able. In their study of 21 patients, long-term (6 to 12 
months) results were available in 11 patients. They 
considered autologous fat ideal for temporary medial- 
ization of a paralyzed vocal fold in patients in whom 
return of function is expected. 


An autologous substance with a low absorption 
rate would be ideal for vocal fold medialization. Free 
fascial grafts have been used for a century in head 
and neck, aesthetic, and reconstructive surgery.!8 The 
most common clinical examples include tympano- 
plasty, augmentation of the nasal dorsum and the na- 
sal tip, and the filling of depressed scars.!9 Accord- 
ing to Peer,” fascia is a stable and predictable graft 
material because of its simple basic structure com- 
bined with a low metabolic rate. Complications of 
fascial transplantation are uncommon, and fascial 
grafts are remarkably resistant to local infection.?! 
Factors that limit their utility are the thickness of the 
required transplant and the vascularity of the recipi- 
ent site.!9 Even though fascia is easily harvested from 
multiple sites of the body, the lack of 3-dimensional 
bulk makes it well suited only for small-volume 3- 
dimensional defects.!9 With these limitations in mind, 
the vocal fold seemed like an ideal recipient site for 
fascia. A new technique was introduced in 1998 by 
Rihkanen,”2 in which the graft is excised by open 
exposure from the fascia lata, chopped into paste by 
a scalpel, and injected into the vocal fold in the same 
manner as fat and Teflon. 


Evaluation of the results of phonosurgery requires 
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subjective and objective measures. No simple asso- 
ciation exists among voice production, subjective au- 
ditory perception, and objective acoustic data. In as- 
sessing patients after a surgical procedure, it is an 
advantage to have preoperative values as a reference 
to the postoperative measurements. Perceptual evalu- 
ation by a panel of voice professionals is still valued 
by many clinicians, despite the difficulty of standard- 
izing the task. Several recent reports of vocal fold 
medialization apply perceptual evaluation, !5:!6.23.24 
since the ultimate goal of surgery is a voice that 
sounds clear and healthy. The Japanese GRBAS sys- 
tem is probably the most widely used scale because 
of its relative simplicity. Even this scale carries the 
problem of intrarater and interrater reliability,2> but 
the general “grade” value, as well as “roughness” 
and “breathiness” of voice, is evaluated by profes- 
sionals with moderate reliability and interrater con- 
sistency.?° 


The acoustic analysis of the voice signal is an at- 
tractive method for assessing pathological voice, be- 
cause it is easy to obtain, noninvasive, and provides 
objective data. Unilateral vocal fold paralysis results 
in a combination of insufficient glottal closure and 
asymmetry in vocal fold mass. The perturbation mea- 
sures of voice acoustics reflect these physical facts 
in 2 different but interrelated ways. Irregular glottal 
pulses are heard as roughness of voice and are re- 
flected by jitter,” a cycle-to-cycle temporal varia- 
tion. Shimmer, a cycle-to-cycle amplitude variation, 
increases with poor and inconsistent contact between 
the vocal fold edges.?8 In addition to period-to-pe- 
riod variations, inconsistent or absent vocal fold clo- 
sure leads to air leakage through the glottis. It is 
acoustically characterized by high-frequency noise. 
The noise-to-harmonics ratio (NHR) gives an estima- 
tion of the proportion of aperiodic signal to periodic 
signal produced by the vocal folds. The NHR seems 
to integrate both principal components of hoarseness: 
breathiness and roughness.?* The reliability of the 
commercially available acoustic analysis software 
programs has recently attracted some criticism. Per- 
turbation values, in particular, agree poorly between 
programs.”? However, with a standard setting of 
equipment, a trend should be discovered as to wheth- 
er the outcome of surgery has a constant impact on 
these values. 


Maximum phonation time (MPT) is commonly 
used430.31 to measure changes in laryngeal valving 
efficiency, despite the many shortcomings the value 
is subject to. Age, sex, size, pulmonary function, gen- 
eral health condition, cooperation, and instructions 
have an influence on the results, so that MPT is not 
very suitable for comparing one group of patients to 


another. However, it is feasible to use when follow- 
up data from the same patients are to be analyzed.?2 


The purpose of this study was to evaluate the ef- 
fects of injection augmentation with minced fascia 
on the voice quality of patients with UVFP. Blinded 
perceptual rating, computerized acoustic analysis, 
and measurement of MPT served as assessment cri- 
teria. Because of the preliminary study, we expected 
the values to shift toward a more normal voice. 


PATIENTS AND METHODS 


From the fall of 1994 to the spring of 1997, pa- 
tients scheduled for surgery for UVFP at Helsinki 
University Central Hospital were invited to partici- 
pate in this prospective experimental study with the 
approval of the ethics committee of the hospital. Each 
patient underwent a preoperative examination by a 
phoniatrician and an otorhinolaryngologist, includ- 
ing patient history, a clinical examination, and video- 
stroboscopy. Each patient had completed at least 10 
sessions of voice therapy (one 45-minute session per 
week from a speech-language pathologist not in- 
volved in this study) before the surgical treatment 
was suggested. The indications for surgery were poor 
voice quality and inadequate loudness as judged by 
the patient and the phoniatrician, combined with an 
open glottal gap during phonation on videostrobos- 


copy. 


A total of 23 patients underwent fascial augmenta- 
tion. In order to find out the impact of the surgical 
procedure on voice or to be able to carry out a blinded 
perceptual evaluation, we excluded patients from the 
study for the following reasons: age greater than 80 
years (N = 1), age less than 16 years (N = 1), previ- 
ous thyroplasty type I (N = 1), or lesions of several 
cranial nerves (N = 1). Two patients were lost to fol- 
low-up, and 2 died of ongoing disease (lung cancer). 
In addition, 1 patient needed arytenoid adduction soon 
after fascial augmentation, and his results were ex- 
cluded. Hence, 9 patients were excluded, and 14 were 
included in this evaluation: 6 men and 8 women from 
42 to 73 years of age (mean age, 59 years). The cause 
of the paralysis was iatrogenic in 10 of the 14 pa- 
tients. The causes of vocal fold paralysis and the du- 
ration of paralysis before medialization are shown 
in the Table. The patients had routine follow-up ex- 
aminations 1 and 6 months after operation. For this 
study, however, all postoperative data were measured 
at a collective supplementary control visit, which re- 
sulted in a different follow-up time (range, 5 to 32 
months) for each individual (see Table). 


Surgical Technique. Under local anesthesia, the 
fascia lata was exposed through a 4-cm longitudinal 
skin incision on the lateral aspect of the thigh. A 3 x 
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DEMOGRAPHIC DATA FOR PATIENTS 








Duration 
Age Cause of UVFP Follow-up 
Patient Sex (y) of UVFP (mo) (mo) 
1 M 73 Vagal neurinoma 27 32 
2 F 64 Struma nodosa* 12 30 
3 F 50 Struma nodosa* 6 18 
4 F 59 Struma nodosa* 9 13 
5 M 65 Struma nodosa* 9 12 
6 M 42 Parathyroid gland 5 12 
adenoma 
7 M 50 External trauma 32 12 
8 F 65 Idiopathic 10 10 
9 F 58 Struma nodosa* 6 8 
10 F 57 Struma nodosa* 16 7 
11 M 64 Papillary thyroid 18 7 
carcinoma 
12 M 6l Coronary 14 6 
arteriosclerosis 
13 F 58 Struma nodosa* 24 6 
14 F 57 Struma nodosa* 6 5 
Mean 59 14 13 


UVFP — unilateral vocal fold paralysis. 
*Vocal fold paralysis caused by surgical intervention. 





2-cm area of fascia was excised with some small 
pieces of subcutaneous fat. The fascia was minced 
into tiny 0.5-mm pieces with a No. 10 blade. The 
paste thus formed (approximately 0.3 to 0.6 mL) was 
placed in a 1-mL syringe. The paste was followed 
by approximately 0.3 mL of fat to act as a vehicle 
for emptying the long needle. Laryngomicroscopy 
was performed under general anesthesia with jet ven- 
tilation. One injection lateral to the vocal process and 
2 injections directed further anterior deep inside the 
fold were carried out with a Karl Storz (Tuttlingen, 
Germany) injection set with an 18-cm-long needle 
(outer diameter, 1.3 mm). The patients were dis- 
charged from the hospital on the same day or on the 
first postoperative day. 


The preoperative and postoperative voice samples 
were recorded in a soundproof booth. A Bruel & 
Kjaer microphone (model 4176) with a mouth-to- 
microphone distance of 30 cm and a Casio DA-7 digi- 
tal audiotape recorder were used. For the acoustic 
analysis and measurement of MPT, each patient had 
to sustain the vowel /a/ at a comfortable pitch and 
loudness level in 2 consecutive recorded trials. For 
the analysis of voice acoustics, we used Kay Elemet- 
rics Computerized Speech Lab (model 4300B, soft- 
ware version 5.X), a personal computer, and the 
Multi-Dimensional Voice Program (MDVP model 
4305). The sampling rate was adjusted to 50,000 Hz, 
and anti-aliasing was used. To diminish the effects 
of initiation, 1 second was erased from the begin- 
ning of the recorded sample and the following 2 sec- 


onds were analyzed. In voice samples lasting less than 
4 seconds, 25% was cut from the beginning and from 
the end. The mean length of the samples analyzed 
was 1.98 seconds. 


Each patient had had several rehearsals of the MPT 
test during voice training. Out of the 2 consecutive 
recordings described above, the longer-lasting per- 
formance was used for the evaluation. 


For the perceptual voice assessment, a standard 
text (an easy-to-read story in Finnish consisting of 
20 sentences) was recorded. Later, a tape was digi- 
tally constructed in which the first 3 sentences (mean 
duration, 20 seconds) of preoperative and postopera- 
tive samples were arranged in random order at 15- 
second intervals. For measurement of panel intrarater 
reliability, the preoperative sample of | patient and 
the postoperative sample of another patient appeared 
3 times randomly on the tape. The panel members 
— 2 phoniatricians and 4 speech-language patholo- 
gists — each had a minimum of 10 years’ experi- 
ence in clinical voice pathology. They evaluated the 
samples in a blinded manner, using a visual analog 
scale (a line from 0 to 10 cm). There were 3 lines for 
each voice sample, representing perceived general 
grades of voice, roughness, and breathiness. The bet- 
ter the quality, the higher the value (mark on the right 
end of the line). For orientation, the panel heard 2 
voice samples (anchor stimuli) of UVFP patients not 
included in this study representing an aphonic voice 
and an average normal speaking voice. 


To test the statistical significance, we used a non- 
parametric test (Wilcoxon signed rank test; SPSS per- 
sonal computer software program, version 7.5) for 
all parameters, because this series consisted of a small 
number of subjects (N = 14), and the distribution of 
results of acoustic analysis was wide and scattered 
unevenly. We considered p values of less than .01 to 
be statistically significant. The intrarater reliability 
of the panel and the correlations between objective 
and subjective measurements were assessed by Pear- 
son correlation coefficient (SPSS). 


RESULTS 


No immediate postoperative complications related 
to the injection procedure occurred, and all patients 
were discharged on the day of surgery or on the fol- 
lowing day. No antibiotics or corticosteroids were 
given. At the donor site, 2 prolapses of the vastus 
lateralis muscle occurred, of which 1 was surgically 
corrected. 


Perceptual Evaluation. Perceptual evaluation was 
carried out by a panel of 6 professionals. The means 
of 6 ratings for each voice sample were used for the 


526 Reijonen et al, Fascial Augmeniation 


Grade Roughness 


Breathiness 


























= eer: 























| 
i 
| 


ae ee eo 





p<.001 





8 9 10 11 12 13 14 M 









AAAA AAA 


"ewww w we 








p<.05 p < .001 


Fig 1. Perceptual evaluation of voice for patients 1 through 14, and mean value (M). Change from preoperative value (un- 
shaded) to postoperative value (shaded) is statistically significant (Wilcoxon signed rank). On visual analog scale, 0 represents 


extremely abnormal, and 10 represents normal. 


analysis. Grade and breathiness values improved in 
all patients, and the respective mean values were 
130% and 138% better after the procedure. Rough- 
ness was rated as slightly increased in 2 patients af- 
ter operation (patients 1 and 2), but the mean value 
improved by 54%. The panel rated 10 of the 14 post- 
operative voices (71%) as normal or near-normal 
(scale value of grade > 7.5). The improvement was 
statistically significant for grade (p < .001) and 
breathiness (p < .001, Wilcoxon signed rank test; Fig 
1). The reliability of the listening panel was tested 
by playing 2 voice samples (1 preoperative sample 
and | postoperative sample) 3 times. Pearson corre- 
lation quotients measuring intrarater agreement were 
high: r= 0.93 tor = 0.99. 

Aerodynamic Measures. Figure 2 illustrates the pre- 
operative and postoperative MPT values. The mean 
values improved from 5.8 to 11.4 seconds, but the 
results were scattered over a wide range. The perfor- 
mance was improved in 12 of the 14 patients, and 
the improvement was more than 25% in 9 of the 14 
patients. The increase was statistically significant (p 
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Fig 2. Maximum phonation time before (unshaded) and 
after (shaded) surgery for patients | through 14, and mean 
value (M). Change is statistically significant (p < .01, 
Wilcoxon signed rank). 


< .01, Wilcoxon signed rank test). Patient 7 had a 
clearly shorter performance after operation. 


Acoustic Analysis. Acoustic data were analyzed 
from 12 samples. The computer program was un- 
able to calculate the acoustic measurements of pa- 
tient 12, because both the preoperative and the post- 
operative samples were insufficiently voiced. The 
preoperative data from patient | could not be assessed 
because of a technical error in the original record- 
ing. In addition, in 6 patients, the original voice sam- 
ple captured from the digital audiotape had to be am- 
plified for MDVP in order to obtain measurable sig- 
nals (patients 3, 4, 8, 9, 13, and 14). Of the 12 pa- 
tients analyzed, there was a statistically significant 
decrease from preoperative to postoperative values 
for all 3 parameters: absolute jitter (p < .01), shim- 
mer percent (p < . 01), and NHR (p<. 01, Wilcoxon 
signed rank test; Fig 3). 


In order to learn how the objective and subjective 
parameters correlate, we calculated the numerical 
change for each parameter and ran a correlation ma- 
trix of these figures. The improvement in perceptual 
grade values correlated to diminished shimmer (r = 
.69), but the association was not statistically signifi- 
cant (p < .05, Pearson). The correlation between im- 
proved perceptual grade and reduction in jitter and 
NHR was not very high. The decrease in breathiness 
failed to correlate to any acoustic parameter. De- 
creased roughness had a correlation to reduced shim- 
mer (r = .3), although it did not show statistical sig- 
nificance (p < .05, Pearson). 


DISCUSSION 


Vocal fold augmentation by injection laryngoplasty 
has undergone few changes in the past few decades. 
The procedure is considered relatively safe and fairly 
easy to master, even for surgeons who perform it sel- 
dom. However, synthetic substances, especially Tef- 
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Fig 3. Acoustic measures for 12 patients. Mean values are expressed by dashed lines. Decrease is statistically significant (p < 
.01, Wilcoxon signed rank). Normative values are jitter of less than 83.2 us, shimmer of less than 3.81%, and noise-to- 
harmonics ratio of less than 0.19 (Multi-Dimensional Voice Program, Kay Elemetrics). 


lon, have created several problems.’ During the past 
10 years, the research in reconstructive and aesthetic 
surgery has focused on autologous graft materials. 
Trials with autologous fat in various head and neck 
sites have proven that the fat graft is well accepted 
by the recipient tissue with only mild inflammation 
and no foreign body reaction.!9 Yet, the long-term 
outcome of fat grafts in humans remains somewhat 
controversial. Significant resorption has been reported 
with free fat grafts in vocal folds, as well as in other 
sites of the body.!4-!7-19 The same holds true for au- 
tologous or bovine collagen, and considerable over- 
correction is required at the time of primary injec- 
tion. The search for a more stable autologous ma- 
terial has led some to consider fascia as an alterna- 
tive graft material. In their thorough article, Boyce 
et al!’ review the experience with transplanted fas- 
cia in head and neck sites. Fascia has been used most- 
ly as a single-layer graft, since it is a good substitute 
for thin, delicate, and pliable tissue (tympanoplasty). 


‘Several reconstructive procedures take advantage of 


its high tensile strength and the fact that it can be 
secured in watertight closurés (replacement for dura 
mater). The fate of these single-layer grafts has been 
studied in numerous animal models.!? After a 3-week 
period of mild inflammation and edema, the graft 
undergoes few changes. The same applies to single- 
layer grafts in humans. Fascia is also well-suited to 
act as a “filler” of small-volume 3-dimensional de- 
fects in augmentation of the nasal dorsum,?? the na- 
sal tip,”! thin lips,*4 depressed scars, etc. In such cases, 
the fascia is often rolled, bunched, or layered, and 
some histologic changes are likely to occur: the num- 
ber of viable fibroblasts decreases and, histologically, 
the graft appears as hypocellular fibrous tissue. De- 
spite these changes, the bulk of the graft is main- 
tained.35 


The structural integrity of the fascial graft, even 


when processed, suggests good long-term reliabil- 
ity. So far, augmentations with fascia have often been 
conducted by placing the grafts into relatively static 
areas of the head and neck.?!,33.35 The vocal fold is a 
dynamic recipient site, and the graft is subjected to 
constant pressure changes during phonation, cough- 
ing, and swallowing. In addition, in order to trans- 
plant fascia by injection, the graft has to be minced 
into tiny pieces. This mincing exposes the graft to 
resorptive processes not mentioned in previous re- 
ports. 


A good vocal result requires not only the stability 
of the implanted material, but also the maintenance 
of elasticity of the vocal fold and free mobility of 
the mucosal lining. Both may be hampered by scar 
tissue and granuloma formation. According to the 
previous experience with fascia in other recipient 
sites, chronic inflammation or a change in the sub- 
epithelial layer is unlikely to happen in vocal folds. 


The results of the present study show statistically 
significant improvements in postoperative voice qual- 
ity. The blinded listening panel rated 10 of the 14 
postoperative voice samples as normal or near-nor- 
mal (grade value of >7.5 on the visual analog scale 
from 0 to 10). In general, it seemed easy for the panel . 
to agree on a “good” voice; the ratings on the right 
end of the visual analog scale were less scattered than 
the ratings of poor voice quality on the left end of 
the scale. This finding is in agreement with those of 
earlier studies.25 The improved quality of voice was 
expressed in all 3 measured parameters: grade, breath- 
iness, and roughness. The improvement in percep- 
tual grade values showed a correlation to reduction 
in shimmer and roughness. However, these correla- 
tions failed to reach statistical significance. Breath- 
iness values showed a weaker relationship to voice 
acoustics. 


The MPT values improved more than 25% in 9 of 
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the 14 patients. Measuring MPT carries a risk of sev- 
eral errors, some of which can be avoided by com- 
paring the preoperative result to the postoperative 
value. It is interesting that the mean values (5.8 sec- 
onds before operation and 11.4 seconds after opera- 
tion) of the present study are almost identical to the 
figures in earlier studies by Lu et al? (6.2 and 11.2 
seconds, respectively) and Adams et al36 (5.6 and 
10.9 seconds, respectively) in which thyroplasty type 
I was evaluated. 


The voice acoustics also showed a significant im- 
provement, as measured by diminished perturbation 
(absolute jitter and shimmer percent) and improved 
NHR. The fact that some of the recorded samples 
had to be amplified for the final acoustic evaluation 
carries a risk of bias on the amplitude values (shim- 
mer and harmonics). Thus, comparison of absolute 
amplitudes was not applied. We chose to present the 
parameters in which amplitude has the least effect 
on results: proportional shimmer (shimmer percent) 
and NHR. Jitter stands for temporal perturbation and 
is not influenced by the signal amplitude. Compared 
to the normative values in the MDVP manual (mea- 
sured in healthy American individuals), the postop- 
erative absolute jitter and NHR values fell within nor- 
mal limits for the majority (66%) of our patients. In 
contrast, the postoperative shimmer values failed to 
reach normative values. 


In addition to the patient who underwent an addi- 
tional arytenoid adduction soon after the injection 
procedure and was not included in the study group, 
3 of the 14 patients (Nos. 4, 7, and 12) ended with 
poor outcomes and can be considered failures. Their 
voice quality was considered moderately or extreme- 
ly pathological after the operation, although some im- 
provement did occur (Fig 1). On close inspection, it 
seems likely that the causes of failure were not re- 
lated to resorption. At the first postoperative visit, 1 
month after surgery, patient 4 still had a consider- 
able glottal gap on phonation during videostrobos- 
copy, indicating insufficient augmentation. A crush 
injury to the larynx with a long-term tracheotomy had 
altered the anatomic structures of patient 7, and an 


attempt to enhance his glottal closure was not suc- 
cessful. Coronary artery surgery coupled with postop- 
erative recurrent laryngeal nerve paralysis resulted 
in a long-lasting mental depression in patient 12, and 
the voice was poor at the postoperative study visit. 


According to the present study, the vocal results 
of fascial augmentation appear to be stable: the panel 
rated the voice samples of the patients (Nos. 1-3) with 
the longest follow-up time (range, 18 to 32 months) 
as normal (Fig 1). The concept of “long-term” re- 
sults with autologous grafts is controversial. In ani- 
mal experiments, free fascial grafts undergo few his- 
tologic changes after the first few weeks.*> Hence, it 
would be unlikely to see major resorption after 6 
months. Therefore, our findings could be interpreted 
to represent the final result. 


CONCLUSIONS 


Injection of autologous fascia offers a reliable 
choice for treatment of patients who require augmen- 
tation of the vocal fold. The procedure is short, easy 
for the surgeon to master, and can be performed on 
an outpatient basis. In the present material, no im- 
mediate postoperative complications occurred. The 
method is efficient in improving voice quality and 
offers stable results, as confirmed here by percep- 
tual evaluation and objective acoustic voice analy- 
sis. Slight overcorrection of the vocal fold might fur- 
ther improve results and, according to our experi- 
ence (unpublished), is recommended. Four patients 
out of 23 with UVFP could be considered failures. 


The purpose of the present study was not to com- 
pare the results of fascial augmentation with those 
of other medialization techniques. This would, how- 
ever, prove interesting, because fairly large glottal 
gaps can be successfully closed by injections of autol- 
ogous fascia. At our institutions, every patient sched- 
uled for vocal fold medialization because of UVFP 
is a candidate for fascia injection deep into the thyro- 
arytenoid muscle. When there are no contraindica- 
tions to short general anesthesia, the procedure is per- 
formed, and arytenoid adduction will be carried out 
later in patients with poor vocal fold tension. 
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FUNCTIONAL ANATOMY OF LEVATOR VELI PALATINI MUSCLE AND 
TENSOR VELI PALATINI MUSCLE IN ASSOCIATION WITH 
EUSTACHIAN TUBE CARTILAGE 
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The anatomic relationships among the levator veli palatini muscle (LVFM), the tensor veli palatini muscle (TVPM), and the 
eustachian tube (ET) cartilage were investigated by computer-aided 3-dimensional reconstruction and measurement methods. The 
study used 13 normal temporal bone-ET specimens obtained from 13 individuals (range of age at death, 3 months to 88 years). This 
study revealed several anatomic features of the anterior cartilaginous portion of the ET. First, the LVPM is always located inferolateral 
to the inferior margin of the medial lamina (ML) of the ET cartilage. Second, the LVPM has a large cross-sectional area throughout 
the extent of the anterior cartilaginous portion of the ET. Third, although the LVPM lies close to the ML of the ET cartilage (0.44 + 
0.16 mm in children and 1.02 + 0.58 mm in adults), there is no region of attachment. Finally, the TVPM is not attached to the lateral 
lamina (LL) of the ET cartilage of the anterior quarter of the cartilaginous pcrtion. Accordingly, it could be assumed that the most 
anterior cartilaginous portion of the ET is opened primarily by the contraction of the LVPM, which causes a superior-medial rotation 
of the ML. Furthermore, since the contraction time of the LVPM is reported to be longer than that of the TVPM, the anterior 
cartilaginous portions of the ET may remain open, even after the middle to posterior cartilaginous portions are closed after relaxation 
of the TVPM. This process would produce a pumping action of the ET in the direction from the middle ear to the pharyngeal side. 
The pumping function may be beneficial to clearance of the middle ear. 


KEY WORDS — computer-aided three-dimensional reconstruction, eustachian tube, levator veli palatini muscle, tensor veli 











palatini muscle, ventilatory and clearance functions of eustachian tube. 


INTRODUCTION 


Several studies have investigated the role of pala- 
tal muscles for opening the eustachian tube (ET).! 
Although most studies defined the tensor veli palatini 
muscle (TVPM) as a dilator of the ET, there has been 
a diversity of opinion concerning the role that the leva- 
tor veli palatini muscle (LVPM) plays in ET opening. 
The results of one electrophysiological experiment? 
in dogs indicated that the LVPM is not active during 
opening of the ET in that species. However, the LVPM 
is so close to the ET lumen and ET cartilage that one 
cannot help wondering what kind of influence it has 
on ET function. This study used 3-dimensional (3- 
D) reconstructions of human temporal bone-ET sec- 
tions to analyze anatomic relationships among the 
LVPM, the TVPM, and the ET cartilage that are rele- 
vant to the role of these muscles in opening the ET 
lumen. 


METHOD 
The materials used for this study were 13 tempo- 


tal bone-ET specimens obtained from 13 -ndividu- 
als whose ages were 3 months to 88 years at death. 


None of the subjects had any known congenital anom- 
aly or clinical history or histologic evidence of ear 
disease. We separated them into 2 groups: children 
(6 cases; age range, 3 months to 10 years; mean age, 
4.2 years) and adults (7 cases; age range, 17 to 88 
years; mean age, 54.6 years). 


The specimens were fixed in 10% formalin, decal- 
cified in 5% trichloroacetic acid, dehydrated in graded 
concentrations of ethanol, and embedded in celloi- 
din.’ After reference marks were made in the celloi- 
din blocks for 3-D reconstruction, the blocks were 
sectioned serially at 30-um thickness as close to per- 
pendicular to the axis of the ET as possible. Every 
40th section was stained with hematoxylin and eosin 
and prepared for light microscopic study. Additional 
sections were stained for the part of the ET between 
the anterior margin of the junctional portion® and the 
posterior margin of the bony portion to determine the 
exact location of the junctional and bony portions 
and the cartilaginous portion along the entire ET. 
These histology slides were then used for computer- 
aided 3-D reconstruction. The equipment used for re- 
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Midpoint of tympanic orifice 





Fig 1. Schematic drawing of left eustachian tube (ET) 
viewed from anterosuperolateral direction. Line A-B is 
long axis of “virtual cut” cross section of ET lumen at its 
pharyngeal and tympanic orifices. Cross section is per- 
pendicular to longitudinal axis of ET. Line C-D is line 
perpendicular to and bisecting line A-B. Midpoints (X) 
of line C-D are seen as centers in each orifice of ET lu- 
men. 


construction consisted of a personal computer (NEC 
PC—9801RL), a frame buffer (Sapience Hyper- 
Frame+) for full-color images, a digitizer (Hitachi 
HDG-1515; resolution, 0.1 mm), and a printer. With 
the software developed by Takagi and Sando,? modi- 
fied by Fujita and Sando,!° and revised by Sudo et 
al,8 acccurate reconstruction and measurements could 
be made of the ET and its surrounding structures in 
3 dimensions. 


superior 


medial 





inferior 


Fig 2. Schematic drawing of “virtual cut” cross section 
of midcartilaginous portion of left ET. View is from pha- 
ryngeal side to tympanic side. This drawing shows how 
to determine midpoint of each cross section of ET. C — 
eustachian tube cartilage. 
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Fig 3. Method of defining standard point for inferior mar- 
gin of medial lamina (ML) of ET cartilage and center of 
gravity of levator veli palatini muscle (LVPM). Also 
shown are directions (vertical and horizontal) that we 
defined for convenience’s sake. LL — lateral lamina of 
ET cartilage. 


The ET lumen, the ET cartilage, the palatal mus- 
cles, and the temporal bones bordering the ET were 
reconstructed for each specimen. The initial step in 
the analysis was identification of a reference line 
connecting the centers of the pharyngeal and tym- 
panic orifices of the ET (Fig 1). The pharyngeal ori- 
fice was defined as the anteriormost section contain- 
ing a complete cross section of the mucosal lining 
of the ET; the tympanic orifice was defined as the 
posteriormost section in which the tympanic annu- 
lus was not present. The center of each orifice was 
defined as the midpoint of a line that is bisecting 
and perpendicular to the line connecting the supe- 
rior and inferior margins of the lumen. The former 
line was termed the medial-lateral direction, and the 
latter line was termed the superior-inferior direction 
(Fig 2). This pharyngeal orifice-tympanic orifice line 
provides a standard axis for creating a series of ap- 
proximately 200 computer-generated virtual cross 
sections (“virtual cuts”). These virtual cuts were all 
made perpendicular to the standard axis through the 
reconstructed images of the ET and its surrounding 
structures (cartilage, palatal muscles, and bone). In 
the virtual cuts, we again defined the pharyngeal 
orifice and the tympanic orifice in the same manner 
mentioned above. 


We reconstructed the 3 portions of the ET de- 
scribed by Sudo et alë: the cartilaginous portion, the 
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Fig 4. Schematic drawings of ET and ET cartilage at site 
at which type A and B relationships of LVPM to ML of 
ET cartilage are shown. Filled dot — standard point of 
ET cartilage; empty dot — center of gravity of LVPM; 
L— ET lumen. 


bony portion, and the junctional portion (located be- 
tween the bony and cartilaginous portions and sur- 
rounded by cartilage in the superior part and by bone 
in the inferior part). This study focused on the carti- 
laginous portion for 3 reasons. First, the LVPM lies 
close to the cartilaginous portion of the ET. Second, 
in the bony and junctional portions, the LVPM is sep- 
arated from the ET lumen by bone and seems to have 
no influence on ET opening. Third, the TVPM is at- 
tached to the lateral lamina of the ET cartilage, and 
not to the petrous portion of the temporal bone. 


Three features were analyzed that may indicate 
actions of the LVPM that have the potential to open 
the ET lumen. First, the positional relationship be- 
tween the LVPM and ET cartilage was investigated 
by defining “standard points” for both the LYPM and 
the ET cartilage in each “virtual.cut” cross section 
(Fig 3). The standard point for the LVPM was de- 
fined as its center of gravity. The standard point for 
the ET cartilage was defined as follows. A line was 
constructed through the superior margin o? the ET 


lumen and perpendicular to the line connecting the. 
superior and inferior margins. Then, the points, at- 
which the line crossed the medial and the lateral mar- 


gins of the medial lamina (ML) of the ET cartilage 
were identified. Then, the midpoint of the ML was 
defined as the midpoint of the line segment connect- 
ing those points. Finally, the standard point was de- 
fined as the point on the inferior margin of the ML 
that was farthest from the midpoint of the ML. Be- 
cause the profiles of the ML of the ET cartilage vary 
among cases, these standard points provided a uni- 
form reference point for describing the positional re- 
lationship between the LVPM and the ET cartilage. 
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4 The part where LVPM shows its maximum area in its cross section 
<-> The part where TVPM attached to the lateral lamina of ET cartilage 


Fig 5. Histogram demonstrating results of present study. 
Described in each case is anatomic relationship (types A 
and B1 of LVPM to ET cartilage, part in which LVPM 
shows its maximum area in its cross section in cartilagi- 
nous portion of ET, and part in which TVPM attaches to 
ET cactilage. Cartilaginous portion of all cases is nor- 
malized in 100 cross sections. 


2 


The second feature reflecting a potential influence 
of the LVPM on ET opening is the horizontal (me- 
dial-lateral) relationship between the LVPM and the 
ML of the ET cartilage. When the center of gravity 
of the LYPM was located lateral to the standard point 
of the ML of the ET cartilage, this anatomic relation- 
ship was classified as a “type A” relationship. On the 
other hend, when the center of gravity of the LVPM 
was located medial to the ML standard point, it was 
classified as a “type B” relationship (Fig 4). 


Since the contraction of the LVPM is isotonic, the 
LVPM should exert the greatest influence on ET 
opening function at the site at which the LVPM has 
its largest cross-sectional area along the cartilaginous 
portion of the ET. Therefore, the third feature of in- 
terest was the cross-sectional area of the LVPM in 
every v-rtual cross section through the cartilaginous 
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Fig 6. Mean cross-sectional area of LVPM in both child 
and adult groups within areas of cartilaginous portion of 


ET. Cartilaginous portion of all cases is normalized in 
100 cross sections. 


portion of the ET. In addition, the shortest distance 
between the ET cartilage and the margin of the LVPM 
was measured in the section in which the LVPM 
showed its maximum cross-sectional area. 


Since actions of the TVPM play a major role in 
opening the ET lumen, it is important to understand 
how the LVPM may act in concert with the TVPM. 
Thus, the site of the attachment of the TVPM to the 
LL of the ET cartilage was compared to the location 
of the features of the LVPM described above. Ac- 
cording to Matsune et al,!! the attachment of the 
TVPM into the ET cartilage is defined as incorpora- 
tion of the tendon of the TVPM into the perichon- 
drium of the tip of the LL of the ET cartilage. The 
location and range of the attachment of the TVPM 
to the LL of the ET cartilage were defined and mea- 
sured within the virtual cross sections of the carti- 
laginous portion of the ET by the same method. !? 


RESULTS 


The loci of the type A and type B relationships 
between the standard points of the LVPM and the 
ML of the ET cartilage are shown in Fig 5, and are 
based on 100 “virtual cut” cross sections through the 
cartilaginous portion of the ET. This Figure also dis- 
plays the site of the maximum cross-sectional area 
of the LVPM and the location of the tendinous at- 
tachment of the TVPM into the LL of the ET carti- 
lage. 


In each case, the anterior aspect of the cartilagi- 
nous portion of the ET always showed a type A rela- 
tionship to the center of the LVPM (Fig 5). The loca- 
tion of areas with a type B relationship varied in each 
case, but they were never found in the anterior carti- 
laginous portion of the ET. The locations of areas 
with type A or B relationships did not appear to dif- 
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Fig 7. Photomicrograph of anterior cartilaginous portion 
of ET and its surrounding structures, obtained from 17- 
year-old girl (case 7). This section shows maximal cross- 
sectional area of LVPM (LVP). Relationship between ML 
of ET cartilage and LVPM is type A. Tensor veli palatini 
muscle (TVP) is not attached to LL of ET cartilage in 
this site. 


fer between children and adults. However, 3 elderly 
female subjects (cases 11, 12, and 13) showed only 
a type A relationship (Fig 5). 


Figure 6 shows the cross-sectional area of the 
LVPM as a function of normalized distance along 
the ET. Analysis of variance (between-groups fac- 
tor, child versus adult; within-groups factor, distance 
along ET axis) of the cross-sectional area from ev- 
ery fifth “virtual section” showed a significant main 
effect between groups (F(1,12) = 13.4, p < .01) that 
indicated a greater cross-sectional area in adults. 
There was also a significant effect of distance along 
the ET (F(20,240) = 19.78, p < .01) that indicated that 
the cross-sectional area of the LVPM varied signifi- 
cantly along the length of the ET, such that it was 
greater in the half of the ET that was proximal to the 
pharyngeal orifice. There was also a significant in- 
teraction effect (F(20,240) = 3.13, p < .05) reflecting 
a steep, monotonic decline in the cross-sectional area 
between approximately the midpoint and the tym- 
panic end of the cartilaginous portion of the ET in 
the adult specimens. 


In the majority of the individual cases, the site of 
maximum cross-sectional area of the LVPM was also 
located in the anterior 50% of the cartilaginous ET 
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(Fig 5), and there was no significant difference in 
location (in percent of the length of the cartilaginous 
ET) between children and adults (p > .1, t-test and 
Kruskal-Wallis test). The shortest distance between 
the margin of the ML of the ET cartilage and that of 
the LVPM at the site at which the LVPM shows its 
maximum cross-sectional area was, on average, 0.44 
+0.16 mm in the child group and 1.02 + 0.58 mm in 
the adult group. Figure 7 is a photomicrograph of 
the anterior cartilaginous portion of the ET at the 
site at which the LVPM had its maximum cross-sec- 
tional area (case 7). 


The locus of the insertion of the TVPM into the 
LL of the ET cartilage ranged from the middle to the 
posterior part of the cartilaginous portion of the ET. 
In a few cases (cases 4, 6, 7), the anterior end por- 
tion extended to the anterior cartilaginous portion 
(Fig 5). 


DISCUSSION 


The LVPM is a palatal muscle that is almost par- 
allel to the inferior margin of the ET lumen and car- 


Fig 9. Schematic drawing of ET and its 
surrounding structures in middle to pos- 
terior part of cartilaginous portion of ET, 
with and without contraction of TVPM 
and LVPM. Within area in which type B 
relationship is present, LVPM has small 
cross-sectional area, and TVPM is at- 
tached to LL of ET cartilage. Contraction 
of TVPM pulls LL inferolaterally, away 
from ML of ET cartilage. Action opens 
major part of ET lumen (L), including 
its narrowest portion (especially its up- 
per half portion). On other hand, eleva- 
tion and rotation of ML of ET cartilage 
with contraction of LVPM is limited by 
presence of both type B relationship and 
small cross-sectional area of LVPM. 





Without contraction of TVPM and LVPM 


Fig 8. Schematic drawing of ET and its surround- 
ing structures in anterior part of cartilaginous por- 
tion of ET, with and without contraction of LVPM. 
Within this area, in which type A relationship is 
present, TVPM is not attached to LL of ET carti- 
lage. LVPM and ML of ET cartilage are in close 
proximity. With contraction of LVPM, ML of ET 
cartilage can be elevated and rotated superior-me- 
dially to open ET lumen (L). 


With contraction of LVPM 


tilage. However, the LVPM is attached to neither the 
ET nor the ET cartilage, as its origin is the inferior 
aspect of the petrous bone and its insertion is the 
soft palate. Although it is recognized universally that 
another palatal muscle, the TVPM, participates ac- 
tively in opening the ET, the role of the LVPM in ET 
function has been the subject of controversy. Accord- 
ing to the experimental report of Honjo et al? and the 
clinical report of Finkelstein et al,? the TVPM is 
thought to be the only dilator for ventilatory func- 
tion of the ET. It is also assumed that contraction of 
the TVPM pulls the LL inferior-laterally (away from 
the medial lamina of the ET cartilage) by a hinge- 
like action!2.!3 that opens the upper portion (roof) of 
the ET lumen.!4 However, several morphological 
studies+> have suggested that the LVPM is a dilator 
of the ET. In particular, isotonic contraction of the 
LVPM would rotate the ML of the ET cartilage su- 
perior-medially to open the ET lumen. Swarts and 
Rood,® using a morphometric method on 15 normal 
adult ETs, also reported that the LVPM might assist 
in ET opening in the anterior portion of the ET. Fur- 
thermore, a recent endoscopic study! suggested that 


Superior 


Medial Lateral 


Inferior 





With contraction of TVPM and LVPM 
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the LVPM, rotating the ML medially, might dilate 
the anterior cartilaginous portion of the ET. 


This study revealed some new anatomic features 
of the anterior part of the cartilaginous portion of the 
ET. First, the LVPM and the ML of the ET cartilage 
always show a type A relationship, such that the cross- 
sectional center of the LVPM is lateral to the ML of 
the ET cartilage. Second, the maximum cross-section- 
al area of the LVPM is found in this region. Third, 
the LVPM and the ET cartilage are in close proxim- 
ity. Finally, the TVPM is not attached to the LL of 
the ET cartilage in the anterior 25% of the cartilagi- 
nous portion of the ET. When the LVPM contracts 
to elevate the soft palate, it presumably pushes the 
ML of the ET cartilage superior-medially (ie, an ele- 
vation and rotation of the ML). It is suggested that 
the LVPM plays an important role in opening the ET 
in this region that lacks a TVPM insertion (Fig 8). 
On the other hand, it is suggested that the middle to 
posterior cartilaginous portion is opened primarily 
by the TVPM, because 1) the TVPM is attached to 
the LL of the ET cartilage and 2) the elevation and 
rotation of the ML of the ET cartilage are limited in 
this region by the presence of both a type B relation- 
ship and a small cross-sectional area of the LVPM 
(Fig 9). Thus, our anatomic findings imply that the 
TVPM is the primary muscle to open the middle to 
posterior two thirds of the cartilaginous portion of 
the ET and that the LVPM may participate in open- 
ing only the most anterior cartilaginous portion of 
the ET lumen by acting to both elevate and superior- 
medially rotate the ML of the ET cartilage. In fact, 
the TVPM appears to be the only muscle that can 


- open the narrowest part of the ET lumen. The 3 old- 


est subjects had only a type A relationship through- 
out the cartilaginous portion of the ET. This finding 
may indicate that the LVPM may be more important 
for opening the ET lumen in elderly patients. Since 





the maximal cross-sectional area of the LVPM is lo- 
cated in the anterior part of the cartilaginous portion 
of the ET in these cases, its contraction may have 
little effect on the lumen of the middle and posterior 
parts of the cartilaginous portion of the ET. How- 
ever, it may still support ET opening that is prima- 
rily due to contraction of the TVPM. 


The patterns of activity within the TVPM and the 
LVPM are important factors in understanding the po- 
tential roles of these muscles in ET function. Honjo 
et al,!©using a cineroentgenographic and electromyo- 
graphic method with monkeys, reported that the ac- 
tivity of the LVPM persisted during dilation of the 
pharyngeal orifice of the ET, even after the end of 
the TVPM activity. This persistent contraction of the 
LVPM raises the possibility that the anterior carti- 
laginous portion of the ET lumen (especially its floor 
portion) may remain open after relaxation of the 
TVPM. Hence, the anterior cartilaginous portion may 
remain open (by tonic contraction of the LVPM) af- 
ter the middle to posterior cartilaginous portion is 
closed by relaxation of the TVPM, and thus could 
generate a pumping action in the ET! from the tym- 
panic side to the pharyngeal side. In particular, a tonic 
contraction could help maintain patency of the tall, 
slitlike lumen!’ in the anterior part of the cartilagi- 
nous portion of the ET, and thus would assist in the 
clearance function of the floor region. !4 


Finally, any study that involves histologic process- 
ing of specimens may produce artifacts such as shrink- 
age of the sections during processing.!8 However, it 
should be noted that tissue shrinkage would not af- 
fect the positional relationship between the LVPM 
and the ET cartilage, the location in which the LVPM 
has its maximal cross-sectional area, or the location 
of the insertion of the TVPM on the LL of the ET 
cartilage. 
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VIDEOSTROBOLARYNGOSCOPY OF MUCUS LAYER DURING VOCAL 
FOLD VIBRATION IN PATIENTS WITH LARYNGEAL 
TENSION-FATIGUE SYNDROME 
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The mucus layer on the vocal folds was examined by videostrobolaryngoscopy in patients with laryngeal tension-fatigue syn- 
drome, a chronic functional dysphonia due to vocal abuse and misuse. Besides the findings in previous reports (such as abnormal 
glottal closure, phase or amplitude asymmetry, and the irregular mucosal wave), the vocal folds during vibration had an uneven 
mucus surface. The occurrence of an uneven mucus layer on vocal folds was significantly greater in subjects with this voice disorder 
(83% or 250 of 301 patients in this series) than in those without voice disorders (18.5% or 5 of 27). The increase of mucus viscosity, 
mucus aggregation, and the formation of rough surfaces on the vocal folds alter the mechanical properties that contribute to vibration 
of the cover of the vocal folds, and thereby worsen the symptoms of dysphonia in patients with laryngeal tension-fatigue syndrome. 


KEY WORDS — functional dysphonia, mucus, stroboscopy, vocal fold vibration, voice. 


It is well known that the final stage of vocal fold 
vibration is a mechanical phenomenon.! Vibration 
of vocal folds is confined mainly to the “cover” of 
the vocal fold, which has a unique multilayered struc- 
ture.” The cover includes a stratified squamous epi- 
thelium (the mucosa) with a surface layer of mucus 
and Reinke’s space. The quality of the voice depends 
largely on the mechanical characteristics of the cover, 
eg, evenness, viscosity, and elasticity of the tissues.3 
Any changes in the mechanical properties that hinder 
the production of a regular mucosal wave will per- 
turb the sound wave and aggravate the symptoms of 
dysphonia.3 


In clinical observations, there are chronic voice 
misusers and abusers who have dysphonia without 
overt macroscopic lesions on the vocal folds.47 Ac- 
cording to Koufman and Blalock,’ more than 50% 
of functional dysphonia patients fall into this cate- 
gory. They named this type of voice disorder “laryn- 
geal tension-fatigue syndrome.” In their opinion, the 
voices of the patients with this disorder are dispro- 
portionately poor in comparison to the laryngeal find- 
ings. 


Empirically and theoretically, the mucus layer on 
the vocal folds plays an important lubrication role in 
voice production.®-!! Reports on the effects of the mu- 
cus layer on the symptoms of dysphonia have been 
very limited. Videostrobolaryngoscopic examination 
of dysphonic patients has become a standard method 
used in voice clinics,!2"4 discovering many causes 





of dysphonia that were obscure at one time. In this 
study, we reviewed the videostrobolaryngoscopic re- 
cordings of patients with laryngeal tension-fatigue 
syndrome and focused on the changes in the mucus 
layer covering the vocal folds during vibration. To 
clarify the possible effects of the laryngeal mucus on 
dysphonia, we compared the results with those from 
subjects without voice disorders. 


MATERIALS AND METHODS 


The videostrobolaryngoscopic recording instru- 
ments used in this study included a stroboscopic light 
source (B&K 4914),a CCD camera (ELMO CN401E) 
mounted with a 60-mm zoom lens (Nagashima TVF: 
60 mm), a 70° laryngeal telescope (Nagashima SFT- 
I), and an S-VHS video recorder. The subjects were 
seated leaning forward with the chin up (ie, in the 
sniffing position) for the examination. The procedure 
was performed with or without local anesthesia (8% 
lidocaine hydrochloride sprayed on the posterior pha- 
ryngeal wall), depending on the subject’s gag reflex. 


Videostrobolaryngoscopic records of 301 cases of 
laryngeal tension-fatigue syndrome, diagnosed at our 
voice clinic over the past 8.5 years (July 1991 to De- 
cember 1999), were reviewed retrospectively. There 
were 68 male and 233 female patients ranging from 
18 to 66 years old. Among them were 25 professional 
singers (8.2%), 157 teachers (52.2%), 92 salesper- 
sons (30.6%), and 27 housewives (9.0%). All of them 
had a history of prolonged voice overuse for more 
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Fig 1. Normal. Thin, film-like mucus layer on vocal folds is invisible in resting position (A) and during vibration (B) on 
videostrobolaryngoscopy. 


than 3 months and suffered from dysphonia for more 
than 2 months. At the time of their visit to our voice 
clinic, none of them had symptoms or signs of pulmo- 
nary or cardiac diseases. None had overt organic le- 
sions on their vocal folds. A diagnosis of laryngeal 
tension-fatigue syndrome was made on the basis of 
medical history and the clinical findings in the lar- 
ynx. Patients with a history of acid reflux or dyspho- 
nic symptoms that began immediately after an up- 
per respiratory tract viral infection were excluded 
from the study. The videostrobolaryngoscopic exami- 
nation was performed under local anesthesia in 180 
patients and without anesthesia in 121 patients. 

For the control study, 27 high school teachers with- 
out apparent symptoms of dysphonia underwent the 
same examination. There were 18 female and 9 male 
subjects ranging from 25 to 46 years old. The exam- 
ination was performed in the early morning before 
going to class, but at least 2 hours after awakening. 
All of the subjects were encouraged to drink as much 
water as possible and not to eat breakfast or drink 
milk before the examination. The procedure was car- 
ried out under local anesthesia in 2 cases and with- 
out anesthesia in 25 cases. 

The records were reviewed twice separately. The 
appearance of the mucus layer on the vocal folds dur- 
ing vibration, in particular, was noted. Statistical anal- 
yses were performed with the SAS System for Micro- 
soft Windows (version 6.12, SAS Institute Inc, Cary, 
North Carolina) statistical software. Fisher’s exact 
test (2-sided) was used for the comparison of the pro- 
portions of categorical variables between different 
groups. 


RESULTS 


Mucus forms a thin film on the surface of the vo- 
cal folds. The smooth mucus layer was normally not 
visible by videostrobolaryngoscopy (Fig 1). In this 


series, the mucus was not identified in 51 of the 301 
patients (17%). We found in 250 patients (83%) that 
the mucus surface on the vocal folds was uneven ei- 
ther during vibration or in the resting position. The 
visible, rough mucus layer could be classified into 3 
types. Type 1 occurred in 152 patients (61%) in whom 
the mucus layer on the vocal folds could not be visu- 
alized in the resting position, but in whom a rough 
mucus surface appeared during vibration, especially 
in the opening phase (Fig 2). The mucus sometimes 
was sticky enough to form | or 2 bridging threads 
between the two vocal folds. When the vibration 
stopped and the vocal folds separated, the mucus film 
became smooth again. The second type was found 
in 26 cases (10%). As in the first type, the mucus 
layer was not visible in the resting position. After 
vibration started, some tiny mucus bubbles accumu- 
lated along the free margins of the vocal folds, espe- 
cially at the anterior commissure or at the junction 
between the anterior one third and the posterior two 
thirds of the vocal folds. These whitish, bubbly ac- 
cumulations resembled vocal fold nodules (Fig 3); 
however, coughing could easily remove them. In the 
third type, mucus lumps were found on the surface 
of the vocal folds before or during vibration (Fig 4). 
The mucus lumps acted as masses on the vocal folds 
and appeared to separate the vocal folds during vi- 
bration. There were 72 cases (29%) in this group. 

The distribution among the 3 uneven mucus layer 
types is shown in the Table according to whether the 
patients were examined with or without local anes- 
thesia. The difference between these two groups has 
weak statistical significance (p = .030). The overall 
percentage of uneven mucus cases was higher in the 
patients with local anesthesia, especially among those 
identified as having type 1 mucus. 

In the control group, only 5 of the 27 subjects 
(18.5%) showed an uneven mucus layer during vo- 
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surface appeared, especially in opening phase. 


cal fold vibration. Of the 5, 2 had type 1 mucus, and 
3 had type 2. The distribution differences between 
the control subjects and the patients with laryngeal 
tension-fatigue syndrome, whether or not local an- 
esthesia was used, were all statistically significant 
(p < .0001). 


DISCUSSION 


Functional voice disorders occur often without 
gross organic lesions on the vocal folds. Most are 














Fig 2. Type 1 (see text). A) Mucus layer could not be identified clearly in resting position. B) During vibration, rough mucus 


diagnosed as laryngeal tension-fatigue syndrome, !5 
which is mainly caused by long-term vocal overuse 
and excessive laryngeal muscle tension. Chronic vo- 
cal overuse is one of the main criteria used to estab- 
lish the diagnosis of laryngeal tension-fatigue syn- 
drome. In this study, we tried to exclude dysphonic 
patients with an apparent history of acid reflux lar- 
yngitis or viral infection, to minimize the possibility 
of superimposing these causes of dysphonia. How- 
ever, the absence of an acid reflux history (an im- 


Fig 3. Type 2. A) During vibration, some whitish mucus bubbles 
accumulated along free margins of vocal folds, especially near 
anterior commissure. B) Whitish mucus bubbles stuck at junc- 
tion between anterior one third and posterior two thirds of vo- 
cal folds, with aggregates of these bubbles resembling vocal 
fold nodules. C) Mucus bubbles could be cleared easily by 
coughing. 
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Fig 4. Type 3. A) In resting position, sticky mucus was found on surface of vocal folds. B) During vibration, mucus lumps 
acted as masses on vocal folds. 


portant diagnostic criterion) cannot completely rule 
out the possibility of reflux, and it is possible that 
reflux was present in some patients and controls.!® 


Vibrating vocal folds periodically interrupt the glot- 
tal airflow to form acoustic signals that are perceived 
as voice.!’ The final stage of voice production is a 
mechanical phenomenon.! Mechanical factors that 
increase turbulence in the glottal airstream will wors- 
en voice quality.* Videostrobolaryngoscopic exami- 
nation helps clarify some of the factors that cause 
functional voice disorders, such as laryngeal tension- 
fatigue syndrome. Abnormal glottal closure, phase 
or amplitude asymmetry, and an irregular mucosal 
wave are common findings in patients with functional 
dysphonia.'*-!4 Clinical reports describing the ap- 
pearance of mucus on the vocal folds during vibra- 
tion have been very limited.®!4 Woo et al,!4 in their 
stroboscopic examination of hoarse patients, reported 
only 2 cases of mucus retention in the absence of 
lesions in the larynx. 


The mucus layer on the surface of the vocal folds 
(part of the respiratory mucus) plays an important 
role in lubrication during phonation.®:!° Mucus se- 
creted by the bronchi and trachea is transported by 


CASE DISTRIBUTION 


Local No 
Anesthesia? Anesthesia? Control: 
Mucus Manifestation No. % No. % No. % 














Mucus not visible 24 13.3 27 223 22. 815 
Type 1 98 54.4 54 44.6 2 7.4 
Type 2 l1 6.1 15 12.4 3 11i 
Type 3 47 26.1 25 20.7 0 0 
Total 180 100 121 100 27 100 

Type | — rough mucus surface in opening phase; type 2 — bubble 


accumulation during vibration; type 3 — formation of mucus lumps. 
p = .030 for a versus b. 
p < .0001 for a versus c and for b versus c. 





the ciliary activity of the respiratory epithelium to the 
larynx.'* In addition, secretory glands in the larynx 
secrete mucus onto the surface of the vocal folds.?:!! 
If excessively high, the viscosity of the mucus on the 
vocal folds appears to change the mechanical prop- 
erties of the cover and hamper mucosal vibration. 
This finding has been verified experimentally in ex- 
cised canine larynges.!° 


In addition to the findings mentioned above that 
may result in dysphonia, our retrospective review of 
videostrobolaryngoscopic examinations of patients 
with laryngeal tension-fatigue syndrome revealed that 
more than 80% of patients in this series had an un- 
even mucus layer during vocal fold vibration. The 
difference between groups with local anesthesia and 
without anesthesia was weakly significant. Our re- 
view revealed that minor irritants like local anesthet- 
ics occasionally cause minor changes in the appear- 
ance of the mucus layer on the vocal folds. We found 
statistically highly significant differences between 
the results found in subjects without (18.5%) and in 
those with voice disorders (whether or not local an- 
esthesia was used). Normally, the viscosity of mu- 
cus is thin enough to preclude detection with video- 
strobolaryngoscopy. In our observations, sometimes 
the mucus on both folds stuck together to form a 
rough surface during the opening phase, characteriz- 
ing the first type of rough mucus layer, and some- 
times it was sticky enough to form lumps on the vo- 
cal folds. characterizing the third type. Judging from 
its appearance, the viscosity of mucus in these 2 types 
should be much higher than normal, although higher 
viscosity has not been proven. High-viscosity mu- 
cus would reduce the mucosal wave during phona- 
tion.!0 The mucus layer serves as an interface be- 
tween the vocal folds and the airflow through the 
glottis. It has been well established in rheology!® that 
airflow over a rough surface will cause turbulence 
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in the current, and well proven in mechanics?° that 
masses added to a vibrator (a vibrating surface) will 
decrease the amplitude and frequency of vibration. 
It appears likely that rough, thick mucus on the sur- 
face of the vocal folds may cause turbulence in pas- 
sage of the glottal air and disturb the voice. All 3 types 
of rough mucus layer observed in this series mani- 
fested as surface irregularity during phonation. In 
type 3 cases, there were persistent mucus Jumps on 
the vibrating vocal folds. The mucus masses may re- 
duce the frequency and amplitude of vocal fold vi- 
bration, causing dysphonic symptoms to worsen in 
patients with laryngeal tension-fatigue syndrome, al- 
though this effect has not been proven in vivo. 


We found no evidence to show whether the abnor- 
mal mucus was from the tracheobronchial tree or 
from mucus-secreting glands in the larynx. In histo- 
logic studies of human laryngeal glands, Sato and 
Hirano? and Nielsen!! postulate that inflammation 
of the vocal folds may alter the morphology of laryn- 
geal mucus glands and change the components in 
the mucus. The component change may increase the 
viscosity of mucus and cause it to congeal and form 
an uneven interface between the vocal folds and the 
glottal airflow, and thereby worsen phonation func- 
tion in patients with laryngeal tension-fatigue syn- 
drome. 


Empirically, mucolytic agents and hydration can 
improve vocal function in patients with functional 


dysphonia.? It is believed that those therapies de- 
crease the viscosity of the mucus and ameliorate the 
evenness of the mucus layer on the vocal folds. They 
are recommended not only in functional voice disor- 
ders, but also in organic voice disorders. It is impor- 
tant to clarify by future studies how laryngeal mucus 
changes in voice disorders, and in response to medi- 
cation, voice therapy, and other factors. 


CONCLUSION 


The mucus layer on the vocal folds plays an im- 
portant role in lubrication during phonation and af- 
fects the quality of the voice. In this series of video- 
strobolaryngoscopic records of patients with laryn- 
geal tension-fatigue dysphonia, we found that 250 
of 301 cases (83%) had an uneven mucus surface on 
the vocal folds during vibration or in the resting po- 
sition. We found a statistically significant difference 
between the results from subjects with and those with- 
out voice disorders (18.5% or 5 of 27), and a small 
but statistically significant difference between sub- 
jects with and without topical anesthesia. We sug- 
gest that a high viscosity of mucus, mucus aggrega- 
tion, and a rough surface on the vocal folds exacer- 
bate the symptoms of dysphonia. According to the 
results of this series, the empirical use of mucolytic 
agents and hydration therapy might be efficacious 
treatment. The changes in mucus components and 
the effects of medication and other treatment on mu- 
cus in voice disorders should be studied. 
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ELECTRONIC ANALYSIS OF INTRINSIC LARYNGEAL MUSCLES IN 
CANINE SOUND PRODUCTION 
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This study explores the relationship between voice production and intrinsic laryngeal muscle (ILM) activities as expressed by 
orderly recruitment of their specific motor units. In 5 dogs, both the recurrent laryngeal nerve (RLN) and the vagus nerve (cranial 
nerve X) were stimulated via tripolar electrodes with stimulating frequencies (Fs) of 10 to 60 Hz and 0 to 7 mA during application of 
symmetric 600 Hz, 7 to 0 mA blocking currents. The fundamental frequency (Fo) and the intensity (1) of sounds generated by tracheal 
insufflation of humidified air were recorded while electromyograms of the cricothyroideus (CT), thyroarytenoideus (TA), and poste- 
rior cricoarytenoideus (PCA) were obtained via surface electrodes. Contractions of the CT were concurrently induced by stimulating 
the superior laryngeal nerve (SLN). The recruitment rates were highly specific and were affected by which nerve was stimulated. For 
the RLN, PCA ramping was lowest for Fs of <50 Hz. For Fs of 10 to 30 Hz, the recruitment rate of the TA was significantly steeper 
than that for the other ILMs, and the CT had the highest rate for Fs of 40 to 50 Hz. Conversely, for the vagus nerve, PCA recruitment 
was highest for Fs of 230 Hz. The average Fo was significantly higher with the RLN than with the vagus nerve. When the TA 
recruited faster than the CT (ie, via the RLN, but not the vagus nerve), the Fo was higher. While only CT ramping was significantly 
related to changes in sound intensity, there was a trend toward a decrease when PCA ramping was higher than CT ramping, as 
occurred when only the vagus nerve was stimulated. Stimulation of the SLN always increased Fo and loudness. We conclude that 
changes in Fo occur mainly through RLN-mediated CT and TA contraction. Loudness is controlled by the CT. The PCA exerts 
reciprocal coupling on both functions via the vagus nerve, and they are boosted across the board by the SLN. These findings may 
allow artificial manipulation of voice. 


KEY WORDS — artificial voice production, frequency analysis, loudness analysis, orderly recruitment, selective intrinsic la- 
ryngeal muscle contraction. 


BACKGROUND AND JUSTIFICATION (cranial nerve X), which thus ultimately controls all 
ILMs. Unfortunately, little is known about the fine 
distribution of larynx-bound neurofibers to the indi- 
vidual ILMs or to their anastomoses within the lar- 


ynx. 


Voice production results from complex interactions 
between intrinsic laryngeal muscles (ILMs) acting 
in combination to effect vocal fold adduction, abduc- 
tion, and tension to vary the resistance of the glottis 


over the expiratory air column originating in the lungs 
(myoelastic theory!). Adduction is primarily due to 
contraction of the thyroarytenoideus (TA), whereas 
abduction is exclusively controlled by the posterior 
cricoarytenoideus (PCA). The cricothyroideus (CT) 
generates tension and a certain degree of additional 
adduction over the narrowed glottis.2 The process of 
vocal modulation is a result of the fine-tuned antago- 
nistic relationships between the TA and the PCA, 
which are both innervated by the recurrent laryngeal 
nerve (RLN). Although the CT is the only muscle in- 
nervated by the superior laryngeal nerve (SLN), both 
the RLN and the SLN stem from the vagus nerve 


Individual ILM contributions to phonation have 
been difficult to outline. Simple vocal fold observa- 
tion contributes little in analyzing the complexities 
of laryngeal muscle interplay, which responds to poor- 
ly defined servomechanisms in the brain stem.‘ In 
vivo electromyographic (EMG) assessments may of- 
fer useful information, but lack statistical power be- 
cause of the scarcity of volunteers and difficulties in 
needle electrode placement during simultaneous re- 
cording of data.>*- Artificially induced phonation in 
animals under general anesthesia does not have the 
benefit of voluntary input.?:!° Theoretically, each ILM 
could be directly assessed by stimulating peripherally 
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located branches of the RLN,!! but concomitant ex- 
citation of anastomotic fibers serving other effectors 
can never be excluded. 


One possible way to answer these critiques and to 
properly appraise nerve-muscle functional continu- 
.ity is to consider individual motor unit architecture, 
since it varies from one unit to another. In a previous 
study, we showed that each group of ILMs could be 
specifically defined by means of an electronic circuit 
specially designed to produce orderly, physiological 
recruitment.!2 In the current research, we hypothe- 
sized that the individual roles of key glottic muscles 
in the production of sound could be defined on the 
basis of the unique attributes of different recruitment 
rates. We thus artificially reproduced sound by trans- 
tracheal injection of a constant flow of humidified 
air forced cranially through the variable laryngeal 
resistor during the natural, orderly recruitment of its 


constituent ILMs. We recognize that this situation 


may differ from normal physiological voice produc- 
tion, in which airflow is variable, and that canine 
data may not necessarily be extrapolated to the hu- 
man situation, in which in vivo verification is in any 
case difficult. This phase I study compares the rates 
of individual EMG compound muscle action poten- 
tial (CMAP) progressions to the fundamental fre- 
quency (Fo) and intensity (I) of the sound recorded 
at the animals’ mouths. 


It is ironic that paralyzed ILMs have been easier 
to study than their healthy counterparts, possibly be- 
cause they may be specifically targeted with foreign 
motor implants borrowed from other, noninvolved 
territories. We previously achieved voice manipula- 
tion in dogs with sectioned RLNs by stimulating 2 
distinct conduits respectively reinnervating the TA 
and PCA while airflow was artificially produced to 
mimic pulmonary expiration.!3 Sound envelopes typ- 
ical for canine barks (330 to 600 Hz!3.!4) were in- 
duced by antagonistic TA-PCA coupling primed over 
a background of restored baseline tone levels. A re- 
turn to normal intermuscular relationships abolished 
by the paralytic process is logical and, indeed, a pre- 
requisite!> for the restoration of the dynamic inter- 
play typical of the rehabilitation processes of any stri- 
ated muscle system. This concept is a corollary to 
the artificial restoration of missing myotatic reflexes 
that ordinary nerve-muscle implantations have been 
unable to achieve.!© While the “reinnervated laryn- 
geal option” may offer promise after complete pa- 
ralysis, it behooves the practitioner to consider that 
most laryngeal dystonias involve an organ system 
that remains anatomically and functionally (albeit im- 
perfectly) connected to its neural grid. Thus, delib- 
erately induced paralysis for the rehabilitation of such 
dystonias appears excessive in light of the difficul- 


ties that may arise from potentially unsatisfactory re- 
innervation, the necessity for separate agonistic drives 
to achieve reciprocal coupling, and the cumbersome 
nature of multiple perineural electrode placements. 


Histochemical studies show that fast muscles such 
as the TA are composed mainly of type II myofibers 
(with only a few type I) as part of small motor units, 
while the contrary is true for slow muscles such as 
the PCA.!7 Interestingly, there is a direct correspon- 
dence between axonal diameters and the number of 
myofibers subtended within the constituent motor 
units.!8 It has also been established that larger axons 
are more excitable than their thinner counterparts. !9 
Thus, standard bipolar stimulation of a nerve com- 
posed of axons of various diameters such as the RLN 
or the vagus nerve should result in premature con- 
tractions of the more excitable muscles (such as the 
PCA), since they contain higher numbers of larger 
motor units and axons. Conversely, the less excit- 
able, faster muscles subtended by thinner axons (eg, 
the TA) cannot be expressed under standard stimu- 
lating conditions. In fact, the bipolar stimulation para- 
digm is in contradistinction to natural muscular acti- 
vation patterns, in which muscles composed of small- 
er motor units are normally activated before their 
larger counterparts (the size principle of Henneman 
et al20). ; 


With the above difficulties in mind and in order to 
study the muscles in their natural state, we previous- 
ly reproduced orderly recruitment patterns using a 
specially designed electronic circuit.2! The recorded 
EMG ramping curves were expressed as averaged 
CMAPSs (in millivolts) via surface electrodes. By 
avoiding direct muscle implantation, surface elec- 
trodes preclude fragmentary evaluation and provide 
the observer with a global assessment of the whole 
muscle’s activities. The CMAP values were plotted 
over time as the intact RLN was stimulated via a tri- 
polar electrode cued by a dual circuit. Contractions 
elicited by the first (bipolar, stimulating) circuit via 
the lone cathodeand the proximal anode were blocked 


through high (600 Hz) frequencies?! by a second cir- 


cuit involving the distal anode, thus achieving tripolar 
stimulation as originally described by Zhou et al22 
and Baratta et al? in feline gastrocnemius prepara- 
tions. Complete muscular blockage was expressed as 
flat EMG CMAP lines for each ILM. Orderly recruit- 
ment progressions were graphically revealed as block- 
ing currents were lifted from these baselines. The 
diverse ramping slopes thus obtained measured EMG 
CMAP (in millivolts) progression quotients over time 
(milliseconds). As intuitively expected, the TA was 
found to have a steeper rate of recruitment as com- 
pared to the PCA, while the lateral cricoarytenoideus 
(LCA) displayed intermediate values.!2 This method 
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Fig 1. Block diagram of study. Larynx and trachea are 
exposed under general anesthesia. Recurrent laryngeal 
nerve (RLN), superior laryngeal nerve (SLN), and va- 
gus nerve (X) are isolated bilaterally. RLN and X (but 
not SLN) are fitted with tripolar electrodes linked to dual 
stimulation/blocking circuit (circled arrow) producing dis- 
tal anodal blockage of currents generated upstream via 
proximal anode and medial cathode. Spherical-shapec con- 
tact electromyographic (EMG) surface electrodes pick up 
resulting compound muscle action potentials (CMAPs). 
Vocal fold adduction (inset) acts as variable resistor while 
humidified air is blown cephalad via second endotracheal 
tube. Microphone at mouth records sound envelope thus 
produced. Recruitment rates of individual muscles (lower 
screen) and sound pressure levels (upper screen) are re- 
corded and compared. PCA — posterior cricoarytenoi- 
deus, TA — thyroarytenoideus, CT — cricothyroideus. 


secured for the larynx complete independence in the 
control of the motor unit firing rates based on their 
orderly recruitments.?+ 


While the muscular actions obtained via orderly 
recruitment are an approximation of what occurs in 
vivo, the latter has never been observed, because stan- 
dard electrodes do not recruit motor units in a physi- 
ological fashion. The use of the blocking circuit thus 
appears to the investigators to be the clearest way to 
separate the actions of the individual muscles in a 
living preparation. Therefore, the current research 
was designed to elucidate the relative participation 
of key ILMs and their relevant neural circuits in pho- 
nation with a view toward the eventual use of this 


information for the dynamic management of various 
laryngeal dystonias. The dynamic management of 
voice proposed in this study is in contradistinction 
to traditional static treatments of the impaired glot- 
tic resistor, in which paralysis (ie, the absence of tone) 
of the vocal fold may only be compensated for by 
medialization, and spastic states (ie, increased tone) 
may only abate via either RLN section or direct mus- 
cular injection of botulinum toxin. These interven- 
tions violate an otherwise intact nerve or the larynx 
itself, offer at best only temporary relief, may be ir- 
reversible, and are fraught with potential complica- 
tions.?4 


MATERIALS AND EXPERIMENTAL METHOD 


Five mongrel dogs weighing 15 to 20 kg were used. 
This number was determined by using a repeated- 
measures analysis of variance (ANOVA) approach 
for the various independent variables under consid- 
eration. By adherence to statistical assumptions of 
the tests performed and use of Bonferroni adjusted p 
values to examine the significance of multiple test- 
ings, this number of animals was set to detect whether 
independent variables accounted for 20% of the vari- 
ability of the response variable with at least 90% pow- 
er, as based on results of a previous study. !? 


After intravenous anesthesia with pentobarbital 
sodium, halothane and oxygen were administered 
through an endotracheal tube in the supine animals 
(Fig 1). Through a midline cervical incision, the dogs’ 
artificial airways were converted to a low tracheos- 
tomy with the distal aspect of the tube facing down- 
ward well above the carina for effective bilateral pul- 
monary ventilation. A second endotracheal tube, des- 
tined to channel humidified air and placed above the 
first tracheostomy site, was directed superiorly. The 
RLN, SLN, and vagus nerve were then anatomically 
isolated on both sides and verified as eliciting muscu- 
lar contraction in the larynx by contact electrical stim- 
ulation with a disposable stimulator (0.5 mA, Vari- 
stim, Clearwater, Florida). The tongue was then sus- 
pended from a frame and the epiglottis was sutured 
to its base to expose the glottic chink without need 
to use direct laryngoscopy for appropriate observa- 
tion. After exposure of the TAs through windows cut 
on each side of the thyroid cartilage, the PCAs and 
CTs were also identified. The larynx was rotated al- 
ternately to one side and maintained in position via 
double-prong hooks under the thyroid lamina while 
suspended to the frame. This maneuver allowed ex- 
cellent ILM exposure on the opposite side. 


A tripolar electrode (Axon Engineering Company, 
Chesterland, Ohio) was then placed around each RLN 
and vagus nerve (on both sides). In addition, a 30 
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Fig 2. Spherical EMG electrodes placed over surface of 
TA (upper arrow), CT (middle arrow), and PCA (lower 
arrow). 


gauge platinum wire was looped around both SLNs. 
Pairs of spherical surface EMG electrodes (N = 3) 
similar to those used in a previous study! were placed 
to effect direct contact over each of the 3 muscles 
(TA, PCA, CT; Fig 2). The stimulation module of the 
circuit (Biomedical Corporation of America, Cleve- 
land, Ohio) produced 10 to 100 Hz, 0 to 7,000 WA, 0 
to 1,000 ms currents with a 600 Hz, 7,000 to 0 uA 
blocking envelope injected into the RLNs and vagus 
nerves. Unilateral and bilateral SLN stimulations (0.5 
to 2 mA) were superimposed in selected cases. The 
EMG probes were positioned over the surface of the 
exposed TA, PCA, and CT and linked to an Astromed 
MT95K2 multitask recorder. A low-impedance (600 
Q) Labtec DM-20SL dynamic cardioid microphone 
was placed at each animal’s mouth and connected to 
a personal computer programmed with CoolEdit ver- 
sion 96 (Syntrillium Software Corporation; Phoenix, 
Arizona) and a SciCon PC Quirer sound analyzer 
(Los Angeles, California). 


Specifically, the initial protocol called for bipolar 
currents respectively stimulating both RLNs and va- 
gus nerves (“S,” in milliamperes) at 10 to 100 Hz in 
10 Hz increments (10 measurements). The intensi- 
ties (milliamperes) were increased until saturation 
(ie, the amplitude level in millivolts on the graph at 
which no further increase in EMG CMAPs occurred 
for each ILM). The intensity of the blocking current 
(“B,” in milliamperes) was then determined by trial 
and error as the amount necessary and sufficient to 
reduce S-generated EMG CMAP amplitudes to a flat 
line. The blocking “B” current (milliamperes) was 
then linearly reduced to zero by 50 uA decrements 
until saturation recurred in each and every 3 ILMs. 
The S/B balance varied from nerve to nerve (Table 
1). Since SLN innervation is traditionally believed 
to be solely directed to the CT, this particular nerve 
was not submitted to a recruiting envelope. Rather, 





TABLE 1. STIMULATION/BLOCKING PARAMETERS 








Data Firing Recruitment 
Group Nerve (mA) (mA) Side 
Animal | 
A RLN 2.0 2.0 R 
B X 2.0 2.0 R 
C X 2.0 2.0 L 
E SLN 0.5-2.0 N/A R/L 
Animal 2 
A X 2.8 2.3 R 
B RLN 4.0 4.0 R 
C RLN 4.1 4.0 I 
D X 5.6 4.1 I 
Animal 3 
A X 3.0 3.0 R 
B RLN 10-20 Hz 1.1 1.1 R 
RLN > 30 Hz 2.5 25 
C RLN 2.3 23 L 
D X 5.0 5.0 L 
Animal 4 
A RLN 3.4 3.4 R 
B RLN 5.0 5.0 L 
C X 7.0 7.0 i 
D X 4.7 4.7 R 
Animal 5 
A RLN 1.5 3.0 R 
B RLN 1.0 2.0 L 
C X 2.7 3.7 L 
D X 2.3 3.0 R 
E SLN 0.5-2.0 N/A R 
Groups A to D are blinded in terms of nerve-muscle combinations. 
Group E relates to superior laryngeal nerve (SLN) stimulation. 
RLN — recurrent laryngeal nerve, X — vagus nerve (nerve X), 


N/A — not applicable. 





the effects of CT contractions first verified in isola- 
tion were applied over a background of overall ILM 
saturation obtained from either RLN or vagus nerve 
stimulation. The humidified air output was set to a 
steady 600 mL/min rate, in the general range fol- 
lowed by other authors?:!° to produce sound appro- 
priately audible by the human ear. 


The chart recorder speed was set at 100 mm/s (ex- 
cept for SLN stimulation, for which it was set at 25 
mm/s, since CT ramping did not need to be evalu- 
ated). Thus, 1 cm on the graph corresponded to 100 
ms in real time. Recording of data for analysis was 
blinded. The EMG CMAP amplitude peaks (milli- 
volts) were measured with an electronic Mitutoyo 
caliper (Tokyo, Japan) at 25 ms intervals (40 data 
points per second). As an example, this rate allowed, 
within an overall stimulation period of 5 seconds, 
gathering of 200 data points per stimulation frequen- 
cy (Fs) run (10 to 100 Hz). The data were then en- 
tered into spreadsheets for the purpose of analysis. 
After completion of the spreadsheets, attention fo- 
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Fig 3. Tongue suspension allows direct glottic view and 
sound recording without endoscope. 


cused on the lowest EMG CMAP amplitude (milli- 
volts) immediately preceding ramping takeoff, as 
initiated by lifting the blocking current, until satura- 
tion returned to original, bipolar stimulation levels. 
This phase usually lasted 3 seconds, which roughly 
corresponded to 180 data points for a given stimula- 
tion run. With 5 dogs having their RLN and vagus 
nerve stimulated from 10 to 100 Hz on each side, 
200 stimulation runs were planned for orderly recruit- 
ment. The data thus collected were lumped into sev- 
eral cohorts comprising individual groupings of CT, 
TA, and PCA amplitudes, either from RLN or vagus 
nerve stimulation. The sound envelope correspond- 
ing to variations in CMAP EMG amplitudes was re- 
corded via the low-impedance microphone and the 
CoolEdit program. The sample rate (number of sound 
samples recorded per second) was set at 4,800 bits 
per second to provide optimum quality of the audio 
spectrum. 


The data sound curves (N = 50) involving 10 (10 
to 100 Hz) frequency measurements (Fs) per ani- 
mal!- were stored on floppy disk. The data were ana- 
lyzed by use of time periods (seconds) coincident to 
EMG ramping, starting with initiation, until satura- 
tion. In this manner, only the window of data during 
the orderly recruitment process was considered for 
sound analyses. The FO (hertz) and loudness or in- 
tensity (decibels) were calculated from sound curves 
with PC Quirer. The Fo from fast Fourier transforms 
within the time windows was calculated at 10 ms 
intervals by setting PC Quirer for optimal display, 
considering window length settings (5 to 30 ms), step 
sizes (5 to 15 ms), frequency tracking threshold 
(+2%), calculation ranges (50 to 300 Hz), display 
ranges (50 to 300 Hz), frequency grid displays (20 
to 50 Hz), and running time grid (0 to 300 ms). The 
FO was graphed to appear as a series of data points 
superimposed over the original, generic sound curve. 
Intensity (I; decibels) was calculated by the PC Quirer 





program every 10 ms. 


The times for initiation of ramping EMG CMAP 
data were then aligned with frequency and intensity 
values simultaneously recorded from the evoked 
sound. Graphically, time “zero” was defined as the 
beginning of EMG CMAP ramping. The derivative 
of the time envelope was then displayed and quanti- 
fied for each ILM. The TA, PCA, and CT ramping 
slopes were grouped in 2 cohorts depending on the 
stimulated nerve (RLN or vagus) as TARLN and 
TAX, PCARLN and PCAX, and CTRLN and CTX. 
These data were then individually plotted against 
evoked FO and I to achieve 3-dimensional curves. 


Repeated-measures ANOVA was used to determine 
differences over the range of stimulation applied. If 
significant differences existed, pairwise comparison 
of muscles was performed and p values were adjusted 
by Bonferroni’s method. Next, the relationship be- 
tween the 3 muscles and 2 nerves with both changes 
in intensity and frequency was evaluated with re- 
peated-measures ANOVA. To determine the amount 
of variability accounted for by these factors, R2 analy- 
ses, the percentage of variability explained, were com- 
puted. R2 is a measure of the percentage of variation 
accounted for solely by the statistical model. To con- 
trast p values and R2, the significance of the p val- 
ues indicate 95% certainty that certain nerves and 
muscles are related to the measurements, and the R2 
values assess the degree to which they explain the 
outcomes. The remaining unexplained variation may 
be due to uncontrollable experimental design factors, 
or perhaps other physiological parameters yet to be 
examined such as “nerve” and “muscle” data for in- 
tra-dog or inter-dog variability. The R2 values do not 
indicate the significance that is found with p values, 
but only that a certain percentage of the variability 
has yet to be quantified. All calculations were per- 
formed with SAS version 8 software (SAS Institute 
Inc, Cary, North Carolina). 


At the completion of these experiments, the ani- 
mals were painlessly sacrificed with intravenous Pen- 
tothal. 

RESULTS 


With the exception of a total lack of response on 
one side following RLN stimulation in dog 1, all 
EMG CMAPs were evaluable and verified on the 
video monitor (Fig 3). Stimulation/blocking (S/B) pa- 
rameters are shown in Table 1. Of 19 nerve-ILM sides 
addressed, a total of 570 EMG sheets were created 
and distributed as follows: 9 RLN-ILM and 10 va- 
gus-ILM cohorts. The groups of experiments thus to- 
taled 57 (19 x 3) for all 5 dogs. Since, however, only 
10 to 60 Hz (6 Fs groups) could be retained for analy- 
sis because of poor ramping differentials above 60 


——.«— 
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TABLE 2. MULTIVARIATE ANALYSIS OF VARIANCE 
RESULTS FOR EMG AMPLITUDE RAMPINGS 











Variable p 
Time since blockage <.001 
Muscle .002 - 
Nerve _ <.001 
Stimulating frequency <.001 
Nerve-muscle interaction <.001 
Muscle-stimulating frequency interaction <.001 


EMG — electromyographic. 





Hz, the number of evaluable data sheets totaled only 
162 (9 x 3 x6). They further comprised 4 data points 
per 100 ms for an arbitrarily designated time inter- 
val of 10 seconds, ie, a total of 64,800 data points 
for RLN-ILMs, and (10 x 3 x 6) or 180 sheets and a 
total (with also 400 data points) of 72,000 data points 
for vagus-ILMs. These numbers were further reduced 
as ramping analysis specifically focused on the 3- 
second intervals separating lifting of blockage from 
saturation. This setup ultimately resulted in an enve- 
lope of 120 data points per ILM per stimulating fre- 
quency for the 3 seconds (ie, 19,440 for RLN-ILMs) 
distributed as 120 EMG peak measurements (4 per 
100 ms for 3 seconds) on 6 frequencies (10 to 60 
Hz) for 9 runs, and 21,600 for vagus-ILMs in which 
all the nerves (10 runs) were productively stimulated. 
These multiple measures increased the power of the 
study design. 


The ramping of individual nerve-muscles as ex- 
pressed after lifting of the blocking currents was high- 
ly specific to each group. Four variables were eval- 
uated for each experiment by repeated-measures 
ANOVA for their effect on EMG recruitment slopes 
(Table 2). They were 1) muscle types (TA, PCA, CT); 
2) nerve types (RLN, X); 3) stimulation frequencies 
(Fs, 10 to 60 Hz); and 4) ramping windows (0 to 
3,000 ms). Analyses indicated that the recruitment 
slopes were independently and significantly affected 
by all of those variables. The effect of Fs on EMG 
ramping was significantly different for each ILM 
(TA, PCA, CT; p < .001) and independent of stimu- 
lation frequencies (Tables 2 and 3). 


On the basis of these results, ILM muscle types 
were compared for each combination of stimulating 
frequencies and nerve types used (Table 4). When 
the RLN was stimulated with 10 Hz currents, the PCA 
recruitment slopes were significantly lower than 
those displayed by both the TA and the CT. With 10 
to 30 Hz, the TA recruitment curves were signifi- 
cantly steeper than those for the PCA, but in the range 
of 40 to 50 Hz, the CT had significantly higher re- 
cruitment rates than the other ILMs. For the vagus 
nerve, no significant differences were observed be- 
tween the 3 muscles under 10 to 20 Hz Fs (Table 5 
and Figs 4 and 5). However, for 30 Hz or higher, 
vagal stimulation produced significantly greater re- 
cruitment rates for the PCA as compared to the CT 
and TA. The PCA’s higher recruitment rates produced 
under vagal stimulation must, therefore, be contrasted 
with the low progression figures obtained with the 
RLN. 


The amount of CT ramping was the only signifi- 
cant variable related to a change in intensity (p = 
.03; Table 5). The relationship is best illustrated in 
Fig 5, which shows that no other ILMs were involved. 
The CT accounts for 33% (R2) of the variability of 


‘intensity; therefore, 67% is created by other sources, 


such as inter-dog and intra-dog variability or possi- 
bly variable proportions of dominant motor units with- 
in the ILMs. Variability in response may also be un- 
derstood as reflecting trends falling, by nature, short 
of complete architectural and histochemical homoge- 
neity. 


The average F0 produced was significantly related 
to the stimulated nerve (p = .02). The Fo was higher 
for the RLN than for the vagus nerve (average, 209.8 
+ 14.9 Hz versus 148.6 + 18.2 Hz). Since the R2 for 
this model was 55%, more than half of the variabil- 
ity in frequency could be explained by which nerve 
was stimulated. We also found that the Fo was sig- 
nificantly correlated with the difference in recruit- 
ment between the TA and the CT (p = .003). When 
the former muscle recruited faster than its counter- 
part, higher Fo values (200.0 + 73.3 Hz) were re- 


TABLE 3. DIFFERENCES IN EMG AMPLITUDE RAMPING BASED ON NERVE AND STIMULATING FREQUENCY 























Vagus Nerve Recurrent Laryngeal Nerve 
Differences Between Pairwise Muscle Differences Between Pairwise Muscle 
Stimulating Frequency (Hz) Muscles (p) Differences Muscles (p) Differences 

10 37 No differences <.001 TA > PCA; CT > PCA 
20 33 No differences <.001 TA> PCA; TA> CT 
30 .005 PCA > TA; PCA > CT 005 TA>PCA 

40 .001 PCA > TA .008 CT > PCA 

50 <.001 PCA > TA; PCA > CT 004 CT > PCA; CT > TA 
60 i <.001 PCA > TA; PCA > CT .003 PCA > TA; PCA > CT 


CT — cricothyroideus, TA — thyroarytenoideus, PCA — posterior cricoarytenoideus. 
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TABLE 4. COMPARISONS OF RECRUITMENT SLOPES 
ACCORDING TO STIMULATED NERVES 





TABLE 5. COMPARISONS OF SLOPES FOR VAGUS 
NERVE AT HIGH AND LOW FREQUENCIES 





Pairwise p Values 











Muscle Mean Slope” Cr TA PCA 
RLN 
CT 0.057 225 162 
TA 0.031 .225 .786 
PCA 0.026 162 .786 
X 
CT 0.044 .716 048 * 
TA 0.039 716 .019* 
PCA 0.073 .048* 019% 


CT — cricothyroideus, TA — thyroarytenoideus, PCA — posterior 
cricoarytenoideus. 


*Statistically significant. 


corded as compared to when the reverse (CT > TA) 
was true (164.4 + 54.8 Hz). Since overall TA ramp- 
ing exceeds that of the CT when the RLN is stimu- 
lated and there are no significant differences between 
the muscles with the vagus nerve, the RLN can be 
considered as instrumental in effecting Fo changes 
(Table 5). 


We next investigated whether the role o? the PCA 
was also related to frequency (FO in hertz) and inten- 
sity (decibels), as compared to other muscles. We 
did find a trend toward decreased loudness for va- 
gus stimulation when PCA recruitment rates were 
greater than CT recruitment rates (p = .07). As we 
have observed, when the vagus nerve is used, the re- 
cruitment rate for the PCA tends to be greater than 
that for either the CT or the TA. Therefore, the re- 
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Pairwise p Values 








Muscle Mean Slope CT TA PCA 
Low frequencies 
CT 044 .770 .722 
TA 040 .770 937 
PCA 040 .722 937 
High frequencies 
CT 044 794 .007* 
TA 038 794 .003* 
PCA AML 007* .003* 


*Statistically significant, 
sulting lower intensity when the vagus nerve is used 
may be created by the recruitment rate of the PCA 
exceeding that of the other muscles (Table 6). Sound 
pressure level (normalized), intensity (decibels), and 
Fo (hertz) curves are depicted in Figs 6 and 7. 


Although isolated SLN stimulation was illustrated 
by lone vocal fold elongation, no changes in the sound 
envelope occurred. By contrast, both frequency (F0) 
and intensity (decibels) were markedly boosted with 
SLN stimulation over prior ILM saturation levels via 
either the RLN or the vagus nerve. The resultant 
sound pressure increases were so high that their maxi- 
ma were off the chart. 


DISCUSSION 


This research constitutes the first attempt to sepa- 
rately define the muscular interactions of the inde- 
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Fig 4. EMG CMAP tracings (in millivolts) of (top to bottom) CT, TA, and PCA for A) RLN and B) nerve X stimulation/ 
blocking envelopes. Stimulation via perineural electrode in bipolar mode (30 Hz, 2 mA for RLN; 20 Hz, 2.1 mA for nerve X) 
produces maximal amplitudes (saturation, left side of picture) in all 3 intrinsic laryngeal muscles. Superimposed anodal block 
(600 Hz, 2 mA for RLN; 600 Hz, 2.1 mA for nerve X) arrests neural conduction, resulting in flat EMG configuration. Progres- 


sive lifting of blocking charges (2 to 0 mA for RLN; 2 


.1 to O mA for nerve X) allows return of EMG CMAPS to their original 


saturation levels. Note that recruitment rates vary from muscle to muscle (differential rampings) and between stimulated 


nerves. Paper speed is 100 mm/s. Calibration is in millivolts. 
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Fig 5. EMG CMAP progressions (normalized values, abscissa) over time (milliseconds, ordinate) for 10 to 60 Hz. When RLN 
is stimulated (stimulation/blocking), TA ramping is highest up to 30 Hz, and CT predominates for 40 to 50 Hz. PCA is lowest 
for all frequencies except 60 Hz. Conversely, this muscle’s recruitment is highest with nerve X stimulation (stimulation/ 
blocking) under 20 to 60 Hz. CT and TA follow grossly identical progressions under those conditions. 


pendent anatomicophysiological units involved in 
voice production via artificially induced recruitment 
strategies. While this approach may not necessarily 
allow extrapolation of canine data to humans, there 
currently appears to be no clearer approach to secur- 
ing effective laryngeal control. 


Muscles are specialized biological machines that 
fulfill specific task-oriented functions dependent upon 
a number of characteristics such as vectorial anatomy, 
mass, stiffness, interactions with companion effec- 
tors, distinct motor neuron architectures, etc. Encase- 
ment within a tight fibrocartilaginous capsule makes 
ILM and related neurologic pathway contributions 
to voice production especially cumbersome. Laryn- 
geal mutually exclusive functions have traditionally 
been thought to be supported by the RLN, if SLN 
input to the CT (which is arguably of an extrinsic 
nature) is set aside, but both nerves carry afferent 


TABLE 6. RELATIONSHIP WITH INTENSITY AND 
FREQUENCY 








p Values 








Variable Changes in Intensity Average Fo 
Stimulating frequency Al 10 
Nerve (RLN versus X) 54 .02* 
PCA EMG results .96 65 

CT EMG results .03* a2 

TA EMG results 87 .83 


Fo — fundamental frequency. 
*Statistically significant. 





and efferent fibers to both nerves within the vagus. 
Fortunately, voice may be conveniently defined by 
its Fo and intensity (I). Such numerical interface for 
voice represents an opportunity compared to other 
laryngeal functions of earlier phylogenetic appear- 
ance (eg, swallowing) that may not be so conveni- 
ently defined. 


The current protocol involved RLN, vagus, and 
SLN stimulation, since the 3 nerves collectively chan- 
nel all afferent and efferent fibers to and from the 
larynx. Whereas the SLN is specific to the CT, ana- 
tomically distinct nerve bundles within the RLN are 
not sufficiently defined, until late division close to 
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Fig 6. Maximal sound pressure levels (normalized) and 
intensity (decibels) curves recorded during standard bi- 
polar stimulation (50 Hz, 2 mA, left side) give way, re- 
spectively, to flat line and drop (arrows) with 600 Hz 
anodal blockage (2 to 0 mA). Progressive lifting of block- 
ing currents allows controlled return toward original val- 
ues with time (abscissa, milliseconds), as seen on right 
side. Paper speed is 100 mm/s. 
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Fig 7. Fundamental frequencies (dots, hertz in ordinate) 
of sound envelope appearing in upper section for 30 Hz 
bipolar stimulation and 600 Hz anodal blockage of 2 to 
0 mA (see also Fig 6). Lower paper speed (25 mm/s) pre- 
cludes more explicit display of time (milliseconds, abscis- 
sa) on graph. 


or within the larynx,” to allow selective ILM stimu- 
lation. Also, the precise intralaryngeal neurofiber dis- 
tribution is not entirely known.?° Therefore, ILMs 
were targeted to power the variable glottic resistor 
via specific orderly recruitment according to the pio- 
neering contributions of Zhou et al?? and Baratta et 
al?3 in the feline hind-limb musculature. 


The significant differences in ramping values found 
in our earlier study with RLN stimulation!2 are con- 
firmed in the current study. Not surprisingly, data 
were equally significant when the vagus nerve was 
involved (Fig 5). Armed with these patterns of inde- 
pendent ILM behavior, we could proceed to the sec- 
ond stage of the current research, allowing ramping 
EMG CMAPS to be plotted against sound data (Fo 
and I) from the acoustic spectrum. Our setup did not 
allow recording EMG CMAPs and sound curves on 
the same graph, and starting times had to be manu- 
ally adjusted. However, potential discrepancies re- 
mained small and inconsequential because of the 
large number of data points, and potential flaws as- 
sociated with raw intersubject or experiment varia- 
tions were avoided, because EMG amplitude com- 
parisons involved series of ratios. 


A detailed discussion of the clinical aspects of hu- 
man voice production is clearly beyond the scope of 
this research. To avoid adding variables to an already 
complex analysis, we have deliberately ignored the 
importance of subglottic pressure variations in the 
genesis of sound, either experimental or voluntarily 
induced. Additional concerns about the true sum of 
multivectorial forces within individual ILMs27-30 
should also be eventually considered. 


Although nearly all ILM contractions, including 
those of the interarytenoid muscles,!!3! boost Fo, the 
increase is generally considered to be a result of 
unique cooperation between the TA and the CT.53!.32 
Conversely, whereas it has been debated whether a 
low Fo may be explained on the basis of an individ- 


ual decrease in CT contraction,*>*4 the TA could ei- 
ther raise or lower the frequency, depending on CT 
tension levels.° Active TA tension generates increases 
(as we showed; Fig 5) as long as CT contraction lev- 
els allow the amplitude of vocal fold vibration to be 
sufficiently large. Conversely, increased CT activity 
lowers the frequencies, because resulting tension in 
the vocal folds?’ outweighs the small gains the TA 
may achieve.*> The resulting voice modulation? may 
thus compare with interactions between the TA and 
other ILMs, such as the LCA, for swallowing,!! even 
though the outputs of this most effective adductor 
and the fine-tuned TA do not necessarily coincide 
nor follow similar timing schedules.°° 


There has been surprisingly little written on the 
PCA’s contribution to voice production, possibly be- 
cause of its indirect role. Although vocal fold abduc- 
tion from isolated stimulation of the PCA via its ter- 
minal branch not unexpectedly results in frequency 
and intensity drops,’ the pivotal function of this mus- 
cle is mutual balance with its antagonist, the TA. This 
balance helps stabilize the cricoarytenoid joint, while 
also opposing increased adduction from a strong CT 
pull, thus bracing the arytenoids and further stiffen- 
ing the vocal folds.3-57-13 


Our findings of high TA ramping rates from RLN 
stimulation (at least up to 30 Hz) come as no sur- 
prise, since the muscle is predominantly composed 
of small motor units with fast-contracting myofibers. 
Conversely, the PCA’s lower recruitment rates ob- 
served under similar experimental conditions are re- 
lated to prevailing slow motor units. Interestingly, 
these progressions are inverse when the vagus nerve 
is stimulated. On the basis of these observations, mu- 
tual reciprocal coupling could follow 2 modalities. 
First, an antagonistic RLN-mediated, TA-bound 
“strong” adduction balances against PCA-bound 
“weak” abduction; an alternate view is that the 
“strong” vagus-activated, PCA-bound abduction bal- 
ances against the TA’s relative “weakness.” This idea 
is further supported by the presence of larger contin- 
gents of adductory fibers in the RLN as compared to 
the vagus nerve, in which abductory pathways would 
predominate. Phylogeny further suggests, at least for 
voice, a more ancient role for vagus-mediated laryn- 
geal abduction (via the PCA, primed to counter air 
hunger), as compared to RLN-mediated (less vital) 
adduction via the TA. 


The Fo increases observed in this study depend on 
CT-TA coupling when the RLN is stimulated. Such 
a relationship is intriguing, considering that in con- 
trast to the TA, the arguably extrinsic CT is solely 
innervated by the SLN.?4 Whatever the responsible 
mechanism (eg, increase in subglottic pressure regard- 
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less of experimental conditions of constant flow!*), 
CT activation under RLN stimulation could be ex- 
plained by linkage within the endolarynx via the ansa 
Galeni, intramuscular terminal branching pat- 
terns,6.37,38 reflexogenic pathways originating in 
ILM proprioceptors, or an antidromic route to the 
vagus nerve. The nonsignificant trend in loudness 
decrease related to PCA contraction only when the 
vagus nerve (but not the RLN) was stimulated sug- 
gests the presence of abductory fibers within the va- 
gus nerve. Obviously, conduction studies would help 
further elucidate this matter. 


On the basis of our data, we submit that voice ma- 
nipulation may become possible in the future. Lev- 
els of loudness may be controlled by RLN stimula- 
tion to achieve CT ramping, coupled with reciprocal 
curbing of PCA activities through the vagus nerve. 
For Fo boosting, the TA may be instrumental, as long 
as the CT does not take over, but the moderator func- 
tions of the PCA still assume its recognized recipro- 
cal function. In all cases, however, direct SLN stimu- 


lation increases Fo and I across the board, assuming 
that prior tension is present in the other ILMs. 


CONCLUSION 


This research has correlated voice frequency and 
intensity in a canine model with recruitment patterns 
of key ILMs stimulated under conditions of subglot- 
tic insufflation. The ramping levels differed from mus- 
cle to muscle and also varied depending on whether 
the RLN or the vagus nerve was stimulated. The TA 
was found to be instrumental in Fo modeling at low 
intensity levels when balanced with the PCA under 
vagus nerve control. The CT enhances both Fo and I, 
ideally through TA-PCA coupling, a cooperation also 
observed for fine tuning at lower intensities. This 
study offers the first statistically significant analysis 
of laryngeal function in vivo under conditions of vari- 
able recruitment of motor units. The technique may 
further allow electronic manipulation of voice for neu- 
rologic conditions that fall short of complete laryn- 
geal paralysis. 
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THERAPEUTIC METHODS USED FOR OTOLARYNGOLOGICAL 
PROBLEMS DURING THE BYZANTINE PERIOD 
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Evidence of herbal, animal, and chemical substances from the natural world used in medicines for otolaryngological problems, 
including opium, hyoscyamus, barley, honey, dried beans and peas, olives, fruits, Agaricus, castoreum, cassia, and afronitron, was 
traced in the Byzantine medical treatises, mainly from the 4th century AD to the 15th century AD. The texts of Antyllus, Orivasios of 
Pergamos, Aetios of Amida, Alexander of Tralles, Paul Aeginitis, Leon Iatrosophistis, Theophanis Nonnos, Nickolaos Myrepsos, 
Michael Psellos, and others strongly suggest the influence of ancient Greek and Roman medicine, but at the same time stress original 
medical thought. The main otolaryngological problems encountered in that period were loss of hearing, purulent otitis, rupture of the 
tympanic membrane, pharyngitis, laryngitis, rhinitis, acute tonsillitis, seasickness, vertigo, fracture of the nose, and cancers of the 
ear, larynx, nose, and oral cavity. The tradition stating that remedies were the final products of substance combinations, started in the 
classical period (Sth and 4th centuries BC), is presented clearly and in detail in Byzantine prescriptions related to otolaryngology. 


KEY WORDS — Byzantine otolaryngology, cancer, hearing loss, laryngitis, otitis, pharyngitis, remedy, rhinitis, tonsillitis. 


INTRODUCTION 


During the Byzantine period (4th to 15th centu- 
ries AD), not only did the medical tradition of Hel- 
lenic and Roman antiquity survive, but there were 
also contributions to original medical thought and 
introduction of new practices and new terms, espe- 
cially in the early centuries (4th to 11th centuries AD). 
The new capital Byzantium founded by the Emperor 
Constantine and inaugurated in 330 AD as the “New 
Rome” — later Constantinople — soon played a cru- 
cial role in the eastern part of the Empire and became 
the true successor to Rome. The Greek language re- 
placed Latin as the official language of the Empire. 
Even though the Empire expanded deep into Asia, 
continuous attacks and wars with neighboring na- 
tions finally limited it. Starting in the 8th century AD, 
the Arabs, spreading the new religion of Islam, con- 
quered Egypt, other parts of northern Africa, and 
Syria; then, the Bulgars and Slavs occupied the Bal- 
kans. During the 11th century AD, the Turks further 
limited the Empire, which finally included only 
Greece and part of Asia Minor in post-Byzantine 
times. However, the Byzantine culture was a bril- 
liant one in almost every field of science. 


During the above-mentioned period, the practice 
of medicine was continuous and was performed by 
public and private doctors, and not by monks, as was 
widely believed until the beginning of the 20th cen- 
tury.! New therapeutic methods were introduced, and 
new medical terms were first used. The definitions 
and some symptoms of many diseases were men- 





tioned for the first time in medical history. The Byz- 
antine physicians transmitted medicine to the Ara- 
bic world. Nestorios, Patriarch of Constantinople in 
the 4th century AD, was also a physician. He was 
exiled by the 4th Universal Synod and with his com- 
panions went to Syria. His followers went to Djondi- 
shambur, where they established the basis of the Per- 
sian Medical School. They also cooperated with the 
neo-Platonians, a philosophical branch from Athens 
who had also been exiled by the Emperor Justinian 
I. During the 7th century AD, the Arabic Medical 
School was established, and later, through the Arabs 
of Spain, knowledge was transmitted to Montpellier 
and Salerno, where famous medical schools were es- 
tablished.” 


This study focuses on the substances the Byzan- 
tine physicians chose from the natural world to treat 
otolaryngological problems, foretelling the progress 
of modern otolaryngology. To uncover the knowl- 
edge of the Byzantine period, we examined the medi- 
cal texts of Antyllus (3rd century AD), Orivasios of 
Pergamos (4th century AD), Aetios of Amida (6th cen- 
tury AD), Alexander of Tralles (6th century AD), Paul 
Aeginitis (7th century AD), Stephanos of Athens (7th 
century AD), Palladios (6th century AD), Leon Iatro- 
sophistis (9th century AD), Theophanis Nonnos (10th 
century AD), and Nickolaos Myrepsos (13th century 
AD). In the 11th century, Michel Psellos undertook a 
large-scale review of the natural sciences. Symeon 
Sith, his secretary, commented on Galen’s philoso- 
phy. In this article, we quote the etymology of an- 
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cient Greek and Byzantine words and the original 
manuscript whenever appropriate to emphasize the 
original context, meaning, and use of the relevant 
words. For the ancient and Byzantine writers, our 
main sources were manuscripts and medical treatises 
from the National Library of Athens, the Cambridge 
University Library, and the CD-ROM edition of the 
Thesaurus Linguae Graecae. 


TYPES OF MEDICINES USED IN 
BYZANTINE PERIOD 


During the Roman period (2nd century BC to 4th 
century AD) and the Byzantine period (4th to 15th 
centuries AD), the use of herbal, animal, and chemi- 
cal substances as the main basis for remedies was 
shown in medical texts. From the Roman period, cen- 
trally acting preparations capable of producing un- 
consciousness and relief of pain were widely used, 
including cannabis, opium, and extracts of plants of 
the Solanaceae family, to which Mandragora, Hyos- 
cyamus, Helleborus, and nightshade (Solanum) be- 
long. In most cases, opium (mekon’s juice, or meko- 
nion in Greek), castoreum (another analgesic con- 
sisting of a secretion from the buttocks or the geni- 
tals of beavers), mandrake, and hyoscyamus were 
used against ear and eye inflammations, pain, ulcers 
in the ear, cough, and fever. Special remedies against 
the cough of tuberculosis or of rheumatism in the 
thorax consisting of opium, Aristolochia, hyoscya- 
mus, sulfur, and honey illustrate the medicine and 
pharmacopoeia of the time. Mekon or hyoscyamus 
leaves mixed with egg were given as plasters, while 
mekon juice (opium), mandrake juice, and even cas- 
toreum were mostly given as drops or as inhalations. 
Decoctions of various mixtures including opium were 
also given as drops. 


During the above period, opium was used in sev- 
eral ways, but mostly, like castoreum, in the form of 
poultices for ear inflammations and in order to re- 
lieve pain and rheumatism. In cases of mild pain, 
opium was used in the form of inhalations, whereas 
in cases of severe pain, mekon leaves mixed with a 
cooked or uncooked egg in the form of plasters were 
also suggested.’ In cases of severe pain, Galen? (2nd 
century AD) prescribed narcotic remedies consisting 
of opium. According to him, Archigenis, one of his 
peers, used against ear pain a decoction of mekones 
(white poppy flowers) mixed with warm oil or with 
woman’s milk or poppy seed powder, while Andro- 
machus (68 AD) used a mixture of myrrh, aloe, and 
mekon. For ear inflammations, Andromachus sug- 
gested halvani (galvanon), Cinnamomum (cinnamon), 
and mekon juice in the form of drops,‘ and for ulcer- 
ous ears, he mixed halvani and opium with honey in 
the form of drops. Another use of opium was in cases 


of ulcerous ears with blood and pus.* 


During the Byzantine period, these remedies con- 
tinued to be given in the form of powder, plasters 
(poultices), drops, pills, inhalations, and “katapotia.” 
The ancient word katapotion is derived from the verb 
katapeno (swallow) and means “anything of a solid 
nature in the shape of a small bullet that is taken by 
mouth.”> With the evolution of medicine together 
with pharmacopoeias, various preparations emerged, 
not only for surgical pain, but also for the pain of 
ulcers, ear inflammations, the cough of tuberculo- 
sis, fever, and other miscellaneous indications. The 
abundance of all relevant plants around the Mediter- 
ranean basin, from which they were exported, played 
a very important role in the development of medi- 
cine around the Mediterranean Sea, especially dur- 
ing the Ist and 2nd centuries AD and later. 


Besides the surgical and conservative therapies 
such as bloodletting, cleaning, special diet, and phys- 
iotherapy, the use of medicines for specific therapeu- 
tic indications began. These medicinal preparations 
were of plant, mineral, metallic, and chemical ori- 
gin, and most of them included opium. They were 
given either as antidoton (antidotes) or for therapeu- 
tic purposes. Antidotes, consisting of chemical, ani- 
mal, or herbal substances, were administered orally, 
rectally, or locally as embrocations (a liquid used to 
moisten or rub a bruised or diseased part of the body), 
poultices, or ointments. Medicines were prepared in 
the form of drosaton (“mixture of pharmaceutical 
liquid substances given orally”), polymaton (mixture 
similar to “drosaton’’), zulapium or serapium (“ex- 


` tract by decocting [boiling] of medicinal herbs”), or 


as sublingual tablets, poultices, or eyewashes. 


The soothing properties of many substances from 
plants, such as the leaves of the olive tree, selinon, 
Helleliphascon, and others, were also used to ease 
removal of foreign bodies from the external auditory 
canal, and a solution of saltpeter and vinegar was 
used for softening of hardened cerumen. All of these 
suggested substances continued the medical tradition 
of Hellenic antiquity. In otolaryngological cases, es- 
sential or volatile oils were not used, whereas oils 
from percolation (mainly olive oil) were highly re- 
spected by both folk and scientific medicine. If one 
dropped olive oil in the auditory canal, the foreign 
body could more easily slip out. 


The main topics of concern of Byzantine medical 
thought in the field of otolaryngology were loss of 
hearing (kofossis, varykoia, or dysecoia in Greek), 
purulent otitis, rupture of the tympanic membrane, 
seasickness, vertigo, acute pharyngitis, laryngitis, rhi- 
nitis, tonsillitis (amygdalitis in Greek), tonsillectomy, 
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cancer of the ear, larynx, nose, or oral cavity, and 
fracture of the nose.” The terminology, stemming from 
Greek words, is still in use today. 


THERAPEUTIC TREATMENTS FOR 
OTOLARYNGOLOGICAL PROBLEMS IN 
BYZANTINE PERIOD 


For hearing loss physiotherapy, cathartic plasters 
and drops formed from chemical, animal, and herbal 
substances were suggested. Sith’s?.® suggestion, a 
mixture with crane’s bile useful for the loss of hear- 
ing, completed the advice of Aetios,’.? which was to 
keep “the holy place clean and the head dry and to 
drop in the ear medicines from nut oil.” According 
to Alexander of Tralles,!© drops of myrrh, incense, 
bitter nuts, afronitron, which is a white foam of ni- 
ter, crocus, opium, and the plant “galvanon” mixed 
with an acid had positive results. Nickolaos Myrep- 
sos!! suggested mixing hare’s bile and woman’s milk 
and, after warming the mixture, dropping it in the 
ear; the patient would be astonished by the result. 
Moreover, he applied a special plaster from an as- 
phodel with oil and suggested rice juice or seeds of 
wild fig trees. 


For acute purulent otitis, for which Sith78 blamed 
cold that consequently caused pain and inflamma- 
tion in the ear, Sith suggested wild mallow and espe- 
cially a paste of small lentils mixed with balsam and 
a paste of pumpkin flowers as a plaster. In the case 
of tympanic membrane rupture, Alexander of Tral- 
les!° suggested dropping substances from nature into 
the ear. In this case, medical treatment and cleaning 
the ear with special instruments was used. Even 
though Aetios? gave similar advice for otitis, he 
seemed to prefer surgical treatment. 


In cases of acute tonsillitis and pharyngitis, Theo- 
phanis Nonnos and Paul Aeginitis’-!2 used washing 
with antiseptic substances, as well as gargles and 
coatings. Alexander of Tralles!9 suggested anti-in- 
flammatory treatment with manure from dogs. He 
also used a mixture of chemical and vegetable sub- 
stances: blueberries and honey with myrrh, nitre (salt- 
peter), and sulfur. Moreover, elxini (a burr) was used 
for tonsillar inflammation. According to Sith,® “zou- 
lapis,” a decoction of pharmaceutical substances, 
named serapium or zulapium by Actuarius, could be 
used during bloodletting, chiefly from the vein un- 
der the tongue. 


Two lists of Greek surgical instruments found from 
the Byzantine period include an antiotomon (tonsil 
knife), strongly indicating that the Byzantine doc- 
tors knew the technique of removing the tonsils, as 
was also mentioned in their scientific texts. 13 


According to Aetios’ method, if a tonsillar abscess 


was presented, it was opened so the pus could exit. 
Then the physician applied a gargle of “melikraton,” 
a mixture of honey and milk or roses. Afterward, he 
suggested gargling with milk several times. If the 
abscess recurred, and medicines seemed not to work, 
he performed tonsillectomy. Aetios, in his descrip- 
tion of the surgical approach, mentioned that the part 
of each tonsil that protruded abnormally from the 
anterior faucial pillar to the middle line was cut off. 
After the operation, he recommended the prescrip- 
tion of cold “oxicraton,” that is, a drink of milk and 
vinegar.’ Paul Aeginitis also described the removal 
of the tonsils by hook and scalpel, and Leon Iatro- 
sophistis (a title for head or teaching physicians) also 
applied tonsillar cutting.!4 They all stressed the dan- 
ger of bleeding. 


For acute laryngitis, Myrepsos’:!! applied sublin- 
gual disks, antidote, and gargles. The suggested anti- 
dote consisted of nuts, sweet rice paste, pine nuts, 
and poppy seed, while the disks consisted of incense, 
pepper, crocus, sweet rice paste, and soft pine nuts. 
Sith’ underlined the positive effects of mallow, which 
made the voice smoother, while Ioannis Actuarius?:!5 
suggested gargles with “moros” for laryngeal infec- 
tions, and Andriomenos!® added the use of plasters 
to improve the voice. 


For vertigo and seasickness, treatment consisted 
of diet, physiotherapy, cleaning, and remedial sub- 
stances. The last came from chemical, animal, and 
other substances, given by mouth in drops, antidote 
paste, and decoction forms. Dioscorides (1st century 
AD) used nut oil, and Aetios used drops, pharmaceu- 
tical resin, and walnut oil. Ioannis Actuarius!> under- 
lined the use of chemical substances that alleviated 
bad mood and applied a special medicine, as well as 
“the theriac antidote.” The latter was a special mix- 
ture of pepper, a poppy decoction, dried roses, iris, 
cinnamon, sweet rice paste, incense, and Athenian 
honey. “Iera Rufus,” a special pharmaceutical compo- 
sition of chemical, animal, and vegetable substances, 
was suggested by Rufus’!7 and Aetios, whereas Ori- 
vasios!® stressed the positive results of Agaricus (a 
fungus genus) and Koriandros’ seed or lettuce seed, 
as well as apple paste, barley, and grapes. Symeon 
Sith’ suggested tree fruits for seasickness and appli- 
cation of water with a sponge, mentioning also how 
warm food and medicines could help. Against tinni- 
tus, Alexander of Tralles!° suggested irrigating the 
ear with acid, honey, and nitre. 


Although Hippocrates (4th century BC) was the 
first to describe the cause of cancer as an excess of 
“black bile,” this view was elaborated by Galen (2nd 
century AD) and dominated Western medical thought 
for more than a thousand years.!9 Byzantine physi- 
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cians believed in a humoral predisposition. Nickolaos 
Myrepsos stressed that tumors appeared to be caused 
by the bad condition of the organism or to bad juice 
in the entrails. Aetios and Theophanis Nonnos spoke 
of “carcinoid dispositions.” According to Aetios, can- 
cer was one of the “melancholic passions.” Paul Aegi- 
nitis, Theophanis Nonnos, and Neophytos agreed with 
him. The last said that cancer was caused by a “boil- 
ing melange of bile.” Theophanis Nonnos and Steph- 
anos from Athens?,29 added that when black bile was 
excessive in the organism, karkinoi (cancers) and on- 
goi (tumors) were caused.!9 There was also a belief 
that external toxic substances could cause cancer, as 
shown by the therapeutic treatment by antidotes. Dif- 
ferent types of cancer were known, apart from child- 
hood cancer, which was not fully recognized.”! 


For cancer of the oral cavity, which was distin- 
guished as cancer of the tongue and the lips, Myrep- 
sos applied a medicine called xirion, which was a 
mixture of the roots of nut trees, seashells, burned 
dates, and pepper plant. For cancer of the ear, Myrep- 
sos7-11 applied the “plaster of Apollonius Megaritus,” 
which, according to him, was beneficial for sarcoma 
and consisted of a burned metallic mixture, terra sigil- 
lata, and walnut oil. For cancer of the nose, Myrepsos 
applied a mixture of eggshells, burned nut flakes, and 
burned palm tree bark. Aetios”.? suggested another 
well-known medicine of his time, “drakontion,” with- 
out giving details about it. However, from other 
sources, it becomes clear that it was an herb for snake- 
bite. Others, like Apuleius, used the term “clematitis” 
for nasal carcinoma and polyps, usually to describe 
a benign tumor appearing in the nasal cavities with a 
pedicle and originating from the mucous membrane. 
Also, malignant and “carcinoid” ulcers were men- 
tioned as appearing in the larynx. Therapeutically, 
chemical, animal, and herbal substances were applied, 
such as ammonium, scammony, castoreum, myrrh, 
Agaricus, Helleborus niger, bdellium (or kommi in 
Greek), cassia, and others. 


Byzantine physicians suggested plugging the nose 
with gauze or spraying pharmaceutical substances 
into the nose for nosebleeds. Alexander!’ suggested 
use of a specially made burned sponge soaked with 
oil; Aetios’ suggestion was similar.? Alexander ap- 
plied plane tree root to stop the bleeding. A paste of 
“hypouris,” a plant living near the water, was sug- 
gested by Aetios for its similar action. Sith® recom- 
mended cinnamon for bleeding. 


Aetios”? used the plant “drakontion” against na- 
sal polyps, and Myrepsos!! suggested a medicine con- 
sisting of styptiria, myrrh, a chemical substance of 
sulfur, and burned copper. Among the two lists of 
Greek surgical instruments mentioned aboye, there 


were special models for excision of nasal polyps.!3 
Texts of the period do not give any further details, 
apart from the way Byzantine physicians managed. 
the odor caused by nasal polyps: they usually dropped 
inside the nose a pharmaceutical powder consisting 
of kissos (ivy), myrrh, dried roses, and other natural 
substances.’ For dyspnea, “xirion of Theodore,” con- 
sisting of grapes, myrrh, and dried roses, was ap- 
plied. 


CONCLUSION 


During the Byzantine period, the main otolaryngo- 
logical problems were not much different from to- 
day’s problems. The main topics of concern were loss 
of hearing, purulent otitis, rupture of the tympanic . 
membrane, seasickness, vertigo, acute rhinitis, phar- 
yngitis, tonsillitis, laryngitis, fracture of the nose, and 
cancers of the ear, nose, oral cavity, and larynx. 


In many cases, chemical, herbal, and animal sub- 
stances were used in the form of drops for therapeu- 
tic treatment. Particularly against inflammations, 
similar substances were used in the form of plasters, 
disks, and antidotes or gargles. Decoctions from plants 
were highly respected, and burned metallic mixtures 
were often used against cancer. Surgical treatment 
was not unknown, and tonsillectomy was used. 


It is important to mention that from Galen’s time 
until the Byzantine period, most substances were used 
together with others in the form of remedies against 
diseases or in other medicinal situations, and not 
alone, as they had been mainly used in early antiq- 
uity. Then, physicians started to distinguish the indi- 
vidual effects of the different substances in the reme- ` 
dies, isolate the therapeutic substances, and acknowl- 
edge their specific actions on the human body. The’ 
more scientific that medicine and therapeutic treat- 
ment became, the more physicians found it desirable 
to rely upon the expertise of a specialist in the prepa- 
ration of remedies. The observations that followed 
from experience acquired the form of a real materia 
medica, consisting of a system of remedies and their 
ingredients. Dioscorides later contributed much to this 
objective. 


Byzantine physicians conserved the original find- 
ings of ancient Greek medicine and improved on 
them. Also, the ancient Greek terminology they fol- 
lowed and further completed was adopted by mod- 
ern medicine. It is obvious that most of the remedies 
they used are nowadays out of use. However, some 
of the substances, mainly herbal ones such as opium, 
are in use even now in the field of otolaryngology 
and constitute contemporary therapy against pain and 
many diseases. 
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` INTRODUCTION 


The Klippel-Trenaunay syndrome (KTS) is a rare 
congenital disorder characterized by the triad of cu- 
taneous capillary hemangioma, soft tissue or bony 
hypertrophy, and varicose veins or venous malforma- 
tions. It can be diagnosed on the basis of any 2 of 
these 3 features.! Other associated manifestations in- 
clude lymphangiomatous and venous anomalies,? 
arteriovenous microfistula,!5-5 visceral and facial he- 
mangiomas, Kasabach-Merritt syndrome,’ dissemi- 
nated intravascular coagulopathy, lipodystrophy, and 
mental deficiency. Vascular malformations associ- 
ated with KTS generally involve deep veins in the 
extremities (mostly the lower extremities)? and some- 
times present as hypoplasia, aplasia, valvular incom- 
petence, or aneurysmal dilation.!° For the most part, 
the lesions are unilateral.!! Lesions in the head and 
neck are uncommon.? In patients with head and neck 
involvement, bleeding and pain resulting from cuta- 
neous vascular nevi, varicose veins, or venous mal- 
formations are the most common symptoms, but neu- 
ral deficiencies are uncommon. In particular, intra- 
cranial lesions in KTS are extremely rare.!2-!8 To our 
knowledge, macrovascular malformation at the cere- 
bellopontine angle in KTS has not been reported, and 
we present here the first case of remarkable venous 
malformation in the cerebellopontine angle in KTS, 
which caused ipsilateral sensorineural hearing loss 
(SNHL). 


CASE REPORT 


A 7-year-old boy with KTS visited our outpatient 
clinic with the chief complaint of hearing loss in his 
left ear, discovered by an annual school screening. 
On examination, the triad of KTS was noted: cutane- 
ous capillary hemangiomas in the left leg and the 
left abdomen, hypertrophy of the left leg and pelvis, 
and varicose veins in the left leg accompanied by 


pain. The patient denied tinnitus, vertigo, or any other 
otologic symptoms. He had no family history of idio- 
pathic or hereditary hearing loss. On examination of 
the head and neck, superficial hemangiomas were 
also found in the left temporal and zygomatic regions, 
as well as in the left part of the pharynx. Both ear- 
drums were normal. A pure tone audiogram revealed 
an SNHL (pure tone average, 65 dB hearing level) 
in the left ear, and the hearing was normal in the right 
ear (Fig 1). The speech discrimination scores were 
100% and 65% in the right and left ears, respectively. 
Both ears showed type A tympanograms. Despite the 
hearing loss in the left ear, the stapedial reflex was 
normally evoked by left-sided stimulation with in- 
tense sound stimuli (>90 dB hearing level); this find- 
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Fig 1. Pure tone audiogram revealed horizontal-type sen- 
sorineural hearing loss in left ear. O — right air conduc- 
tion thresholds; X — left air conduction thresholds; ] — 
left bone conduction thresholds. Pure tone averages were 
10 dB hearing level (dBHL) and 65 dBHL in right and 
left ears, respectively. 
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Fig 2. Distortion product otoacoustic emissions showed A) good response in right ear and B) no measurable 


response above noise levels in left ear. 


ing indicated a recruitment phenomenon, which is 
usually observed with inner ear damage. Waves I, 
III, and V were clearly recorded by auditory evoked 
brain stem responses in both ears, with slightly longer 
latencies in the left ear (wave V latency, 5.92 ms on 
the left and 5.44 ms on the right at 75 dB normal 
hearing level). However, this latency difference on 
the auditory brain stem evoked response was not con- 
sidered significant, because of the large difference 
in auditory threshold between the right and left ears. 
Distortion product otoacoustic emissions were re- 
corded with the ILO92 otodynamics system (f2/f1 = 
1.2), which showed impairment of the left cochlear 
function (Fig 2). Caloric responses were normal in 
both ears. No facial palsy or any other neurologic 
disorders were noted. The ossicles and the cochleas 
appeared normal on high-resolution computed to- 
mography of the temporal bone. Imaging studies of 
the brain, including computed tomography and mag- 
netic resonance imaging, revealed an intracranial ve- 
nous malformation (Figs 3-5). The left jugular fora- 
men and the left jugular bulb were anomalously large 
(Fig 3). A series of computed tomographic scans re- 


Fig 3. Computed tomograms. A) 
Coronal scan shows dilatation of 
left jugular foramen at skull base. 
B) Axial scan shows anomalously 
large left jugular bulb that extends 
to medial bony wall of tympanic 
cavity. 


vealed anomalous dilated veins in both cerebellar 
hemispheres arising from the left jugular vein (Fig 
4). The diameter of the anomalous vein became grad- 
ually smaller with branching in the periphery, as in 
normal vessels, without any masslike enhancing 
structure. Since these images were not compatible 
with those of a hemangioma, which usually presents 
as an encapsulated enhancing mass with a feeding 
artery and a draining vein, the lesion was diagnosed 
as a venous malformation. The trunk of the anoma- 
lous vein occupied the left cerebellopontine angle 
(Fig 5), presumably influencing the eighth cranial 
nerve. 


DISCUSSION 


The clinical characteristics of 252 patients with 
KTS have been reported.’ Of these patients, 63% had 
all 3 features of KTS, and 37% had 2 of the 3 features. 
Capillary malformation was the most common fea- 
ture (98%), followed by soft tissue or bone hyper- 
trophy (94%) and varicosity (72%). Pain was the most 
common symptom other than the triad.” Our patient’s 
symptoms corresponded with the triad. The findings 
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that all the malformations, including those in the head 
and neck and in the cranium, were left-sided, the pain 
in the lower extremity was on the left, and the hearing 
loss was on the left correspond to the unilaterality of 
KTS." 


Intracranial vascular anomalies are rare in patients 
with KTS.!2-!8 Goldstein et al!? reported aplasia of 
the cervical internal carotid artery and malformation 
of the circle of Willis in a patient who presented with 
headache and photophobia. Intracranial aneurysms 
have been reported in 4 cases. 13-16 Spallone and Tche- 





Fig 4. Series of axial computed 
tomography scans with contrast 
enhancement reveals venous mal- 
formation of bilateral cerebellar 
hemispheres, arising from left 
jugular vein. Sections are arranged 
from lowermost section (A) to 
uppermost section (F). A) Trunk of 
abnormal vein originates from left 
jugular vein. B) Note large differ- 
ence in size of sigmoid sinuses. 
Left sigmoid sinus is much wider 
than right. C) Branching of anom- 
alous vein. D) One of 2 main 
branches crosses brain stem to oth- 
er (right) side. E) Each main branch 
drains each hemisphere of cerebel- 
lum. F) Peripheral branching pat- 
tern of this anomalous vein is sim- 
ilar to that of normal veins. 


rekayev!+ reported decreased visual acuity due to an 
aneurysm. Intracranial hemangiomas have been re- 
ported in 3 cases.!®!8 The symptoms depended on 
the size and localization of the aneurysm or hemangi- 
oma. However, SNHL caused by intracranial vascular 
anomalies in KTS has not been reported. 
Macrovascular SNHL, in which vessels proximal 
to the labyrinthine artery or vessels that can be visual- 
ized without the use of a microscope are involved, is 
not uncommon.!? Macrovascular SNHL can be 
caused by aneurysms, vertebrobasilar artery dolicho- 


Ishimoto et al, Imaging Case Study of the Month 561 





Fig 5. T2-weighted axial magnetic resonance image re- 
veals trunk of venous malformation occupying left cere- 
bellopontine angle. 


ectasis, arteriovenous malformations, and vascular 
loops.!? Barr et al? reported clinical and magnetic 
resonance imaging features that suggested telangiec- 
tatic vascular malformation of the pons. These pa- 
tients exhibited a variety of symptoms, including head- 


ache, vertigo, gait abnormality, and hearing loss. Mur- 
phy!® reported that the hearing loss might improve 
after the macrovascular lesion is treated. In our case, 
surgical intervention has not been planned, because 
the malformation was not considered to endanger the 
patient’s life. 


Distortion product otoacoustic emissions are a 
promising physiological tool for detecting subtle path- 
ological conditions within the cochlea.*! In our pa- 
tient, retrocochlear impairment could not be excluded, 
because of the slight elongation of wave latencies in 
the auditory brain stem response and the moderate 
deficiency in the speech discrimination score. How- 
ever, absent responses in distortion product otoacous- 
tic emissions and the recruitment phenomenon sug- 
gested that the principal cause of SNHL existed in 
the cochlea. In fact, in a study of acoustic neuromas, 
which also form space-occupying lesions at the cere- 
bellopontine angle, the cause of SNHL was attributed 
to cochlear damage in more than half of the cases.?? 
In previously reported cases of macrovascular SNHL, 
the cause could be attributed to cochlear impairment 
in some patients!?? and to retrocochlear damage in 
others. !° 
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LETTERS TO THE EDITOR 


To the Editor: 


The article by Damrose et al (Lack of evoked laryngeal elec- 
tromyography response in patients with a clinical diagnosis of 
vocal cord paralysis. Ann Otol Rhinol Laryngol 2001;110:815- 
9) that discusses electromyography (EMG) responses in the sur- 
gery of recurrent laryngeal nerve (RLN) paralysis warrants sev- 
eral comments and recommendations. Because it is often diffi- 
cult to arrange conventional electrodes during thyroidectomy, 
thyroplasty, and esophagectomy, the resulting data consequently 
may be of limited value. Because of these technical failings, a 
variety of indwelling electrode-bearing endotracheal tubes were 
developed.!-!! Unfortunately, the authors inadequately describe 
their device, and the interpretation of their data is contingent 
upon this information. As shown in our Fig 1, the electrodes may 
not contact the vocal fold (VF), and the resulting signals can rep- 
resent activity from the contralateral VF or, potentially, bilateral 
activity. In addition, the false-positive EMG responses reported 
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Fig 1. Inconsistent locations of electrodes cause poten- 
tial errors in results of electromyographic recording. In 
cross section of glottis (A, upper), symbols are used in 
electrode: black circle for active surface electrode and 
open circle for reference electrode. Al — rotation of 
electrode to counterclockwise position; Ao — proper 
location of electrode; Ar — clockwise rotation of elec- 
trode. In coronal view of larynx with intubation tube in 
situ (B, lower), Bs — did not contact vocal folds; Bo — 
adjacent to vocal folds; Bd — inferior insertion, below 
vocal folds. Respective results should be recorded from 
locations at Ao and Bo of electrodes. 


Bs 


a 





in this article may in fact be merely stimulating artifact without 
a compound action potential (CAP). 


We suggest that the authors have not established whether the 
recorded responses originate in the RLN or in the superior laryn- 
geal nerve (SLN). In their Figure 2C, employing the authors’ 
term “normal evoked response,” the onset of the electrical re- 
sponse should be negative (as in their Figure 2A). However, we 
interpret this configuration as an M wave, ie, a direct muscle 
response. It is important to note that there is no evidence of a 
direct connection between the SLN and the thyroarytenoid mus- 
cle, nor did the authors discuss such a possibility. It is possible 
that an overimposition of signals comes from both the cricothy- 
roid and thyroarytenoid muscles. The wavelets modified may 
lead to an erroneous interpretation. To rule out this possibility, 
the wavelets could be compared by simultaneous surface and 
concentric electrodes in both the cricothyroid and thyroarytenoid 
muscles. 
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Fig 2. Responses to RLN stimulation: direct response, R1, and 
R2 from thyroarytenoid muscles in humans with hooked-wire 
electrodes. S — stimulation. 


The authors failed to adequately describe the consequence of 
vagal stimulation and its impact upon SLN and RLN stimula- 
tion. The appropriate location of vagal stimulation, as we under- 
stand it, is at a point in the main trunk prior to or central to the 
laryngeal branches. An absence of CAP following vagal excita- 
tion implies that the axons of both RLN and SLN motoneurons 
have not regenerated. 


We are aware of the diagnostic limitations of videostrobola- 
ryngoscopic (VSL) imaging and its role in the diagnosis of neu- 
ral paralysis or paresis and cricoarytenoid joint dysfunction. In 
our opinion, a comprehensive electrophysiological evaluation 
using the accepted criteria is essential for confirmation of the 
visual impression given by VSL imaging. 

The response to SLN stimulation has been reported and is of 
considerable clinical value.!2-!5 Although RLN stimulation has 
been used in the laboratory,!?:!®!7 the application for diagnosis 
of laryngeal paralysis remains uncertain. There are 3 types of 
response to RLN stimulation in humans. !%!8 Initially using a 1- 
ms pulse, stimulation levels are increased until an early direct M 
response around 3 ms is elicited in the thyroarytenoid muscle, 
implying RLN stimulation. Furthermore, a consistent M re- 
sponse can be elicited between | and 2 mA. Finally, the exciting 
levels were then gradually increased until the supramaximal 
ranges were reached. The results were interpreted as an ipsilat- 
eral response (R1) and bilateral responses (R2; Fig 2). If the re- 
cording electrodes and stimulation electrodes are in the correct 
location with a large window, then 3 wavelets can be recorded: 
the R1 and R2 reflex responses and the M wave. The clues for 
differentiation between peripheral and central laryngeal paraly- 
ses, therefore, can be obtained by multiple bilateral laryngeal 
muscle EMG recordings and laryngeal nerve stimulation tech- 
niques. 

We know from experience that it is possible to evaluate im- 
mobile VFs accurately by using EMG in the earliest period of 6 
to 8 months following the onset of damage. A shorter interval for 
surgical interference is not recommended. Though anecdotal, 
we have examined 3 cases of VF paralysis following carotid 
endarterectomy by EMG (unpublished data). In these individu- 





Fig 3. Bilateral recordings from thyroarytenoid (TA) and crico- 
thyroid (CT) muscles in patient who underwent carotid endar- 
terectomy. Right recording is obtained from TA muscle during 
phonation of /i:/, and left from CT muscle during pitch change. 
Top recordings were obtained 7 days after surgery: right TA 
muscle showed electrical silence, and left TA muscle showed 
positive sharp wave. Bottom recordings were collected 3 months 
later: right TA muscle showed reduced recruitment potentials, 
and left TA muscle showed normal recruitment potentials. 


als, a change from a neuropathic pattern to a normal pattern oc- 
curred within 3 months (Fig 3). Consequently, our protocol, in- 
sofar as surgical exploration is concerned, has established a 6- 
month grace period to allow for regenerative recovery (refer to 
patient 15 in the Damrose et al report). 


From aclinical standpoint, we recognize that synkinesis iden- 
tifies a degree of natural recovery. We propose that when this 
situation is both clinically evident and confirmed by EMG, be- 
fore nerve grafts or anastomosis, other options, such as thyroplasty 
or botulinum toxin injection, be allowed initial consideration. 


Finally, many problems in the neuromonitoring of the larynx 
remain in our attempt to design a reliable, noninvasive, and 
convenient device, as well as in the appropriate use of RLN 
stimulation techniques to prevent RLN injury. We look forward 
to the authors’ responses. 


SHENGGUANG YIN, MD* 

Lours W. WELSH, MD 

FRED J. STUCKER, MD* 
*Dept of Otolaryngology— 

Head and Neck Surgery 
Louisiana State University Health 
Science Center in Shreveport 

1501 Kings Highway 
Shreveport, LA 71130-3359 
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To the Editor: 


Tread with interest the article by dos Santos et al (Involvement 
of level I neck lymph nodes in advanced squamous carcinoma of 
the larynx. Ann Otol Rhinol Laryngol 2001;110:982-4). I have 
three criticisms regarding this article. 


Involvement of levels I and V by metastatic disease is quite 


` rare in squamous carcinoma of the larynx. While level J metas- 


tases are reported for supraglottic carcinomas, level V metas- 
tases are associated with involvement of multiple levels. In this 
article, there is no information on the tumor localization of the 
2 patients who had metastases in level I. 


Also, I object to the statement of the authors that “there is no 
impact of level I involvement on overall survival” (p 984) when 
there were only 2 patients with metastases of this level. 


My third criticism is about the writing style. The introduction 
and discussion parts of the text are nearly the same as those of 
another paper, published in 1991.! In my opinion, in writing an 


13. Thumfart WF. Electrodiagnosis of laryngeal nerve disorders. En- 
technology 1988;67:380-4, 386-8, 390-3. 
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Laryngol 1992;101:127-34. 
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article, it is not appropriate to use statements or paragraphs from 
another study. In particular, the discussion section should be 
original and reflect the authors’ own ideas, explanations, com- 
ments, or experience. 


Although the authors have addressed the relevant reference in 
their reference list, their use of sentences and paragraphs exactly 
the same as those in a preceding article may be regarded as ob- 
jectionable. 

TANER K. ERDAG, MD 
Koruturk Mah. Mese Sok. 
Erarslan Apt No. 17/5 
Balcova 35330 

Izmir, Turkey 
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BOOK REVIEWS 


Otitis Media in Infants and Children 


Charles D. Bluestone and Jerome O. Klein. Third edition. 
Hard cover, illustrated, indexed, 404 pages, 2001. Saunders, 
Philadelphia, Pa, $85. l 


Purpose: “To provide a critical and contemporary review of 
the important and latest medical information that is of interest to 
all health care workers committed to the well being of infants 
and children.” The material in this edition is based on investiga- 
tion by the authors and a review of the literature as of January 1, 
2000. 


Contents: This book can be divided intọ 3 parts: 1) basics 
(definitions, anatomy, physiology); 2) causation (epidemiol- 
ogy, microbiology, immunology); and 3) diagnosis, treatment, 
and complications. The first edition was published in 1988, the 
second in 1995. Updating in this edition includes new classifi- 
cation and staging systems for otitis media and its complica- 
tions. A classification includes eustachian tube dysfunction. 
There is an up-to-date portion on the method of clinical and 
microbiological diagnosis of otitis media. New information in- 
cludes antibiotic resistance of otitic bacterial pathogens and the 
‘latest available information regarding viral and bacterial vac- 
cines for prevention. Most important, it includes reprinting of 
some of the classic studies of the previous books. 


Attributes: The following is quoted from Adam Politzer’s last 
edition of the Lehrbuch, published in 1926: “Diseases of the 
middle ear originate in the mucus membrane lining; from here 
those inflammatory changes start which not only often disturb 
the hearing, but not infrequently extend to neighboring vital 
organs, and endanger the life of the patient. The knowledge of 
these changes is therefore of great importance as they form the 
basis for the diagnosis and treatment of middle ear infections.” 
(Politzer A. Textbook of the diseases of the ear. 6th ed. Philadel- 
phia, Pa: Lea and Febiger, 1926:270). 


In the current volume, the authors have elaborated extensive- 
ly on those objectives pointed out by Politzer approximately 75 
or more years earlier. They have attempted to evaluate and dis- 
cuss the various aspects of serous otitis media in a concise man- 
ner directly applicable to expeditious patient care. 


At the time of this publication, Tauno.Palva has published an 
essay whose thrust is to aerate or alleviate disease in the epitym- 
panum in cases in which multiple tubes have been needed. Per- 
haps this essay of Palva should be reviewed by Bluestone and 
Klein to assess otitis media complications before a fourth edi- 
tion. 


Applicability: The book is of use to all physicians involved in 
the treatment of otitis media in infants and children, especially 
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otolaryngologists, pediatricians, and family practitioners. It is 
another step toward better understanding and more accurate and 
effective treatment of the various aspects of otitis media. 


DENNIS G. PAPPAS, MD 
Birmingham, Alabama 


Decision Making in Ear, Nose, and Throat Disorders 


Cuneyt M. Alper, Eugene N. Myers, and David E. Eibling. 
Hard cover, indexed, 385 pages, 2001. Saunders, Philadel- 
phia, Pa, $95. 


Purpose: To provide basic clinical information, in an algo- 
rithm format, that is relevant to otolaryngologists in making a 
diagnosis. 


Contents: In addition to the anatomic sections, several other 
topics are covered that are important to the overall review of 
otolaryngology, including neck, plastics, and general subjects 
(bleeding disorders, anesthesia, and, to be exact, 11 more such 
general subjects). 


Attributes and Limitations: Each lesson (topic such as dizzi- 
ness) is structured so that it provides an explanation of a portion 
of the algorithm (designated by alphabetical letters that match in 
the algorithm and the page of explanation). Typically, the expla- 
nation part consists of short paragraphs that offer concise, high- 
quality, appealing features of the decision-making examina- 
tion. The text part (explanation) is an overview consisting of 
basic information on the specific subject. A problem that emerges 
from such a model, which necessitates brevity of information, 
is that important features of information are omitted. For ex- 
ample, it would be difficult to diagnose “psychiatric dizziness,” 
as suggested (p 22) on the basis of “psychiatric symptom cluster 
(e.g., dizziness during a panic attack).” There are a complex of 
symptoms that make up this syndrome. 


Each section (topic) is self-contained on 2 pages that face 
each other, except for a number of the algorithms that refer to 
other chapters. The book has a touch of traditional features in the 
text and a contemporary feature in the format. It is unique. The 
authors met the goal of the algorithm format: maximizing the 
amount of information within a minimal amount of text. 


Applicability: One of the authors felt that the information in 
this book would be useful to-medical students, residents, and 
junior practitioners in our specialty. I think, also, that the vol- 
ume offers a review resource to the more seasoned practitioner. 


DENNIS G. PAPPAS, MD 
Birmingham, Alabama 
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Design Features (US patent 5,893,830; international patents pending) 
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e The UM glottiscope is supplied with a variety of examining tubes which accommodate to the spectrum of 
human size and shape, irrespective of gender, age, or disease, and that attach to a single universal handle. 

e The universal, ergodynamically designed titanium handle can be joined with a suspension gallows as well as 
American and European chest-support holders. 

e The detachable base-plate is ideally suited for difficult intubations; lighting and suction cannulae are contained 
within the base-plate. 
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DENDRITIC CELL INFLUX DIFFERS BETWEEN THE SUBGLOTTIC 
AND GLOTTIC MUCOSAE DURING ACUTE LARYNGOTRACHEITIS 
INDUCED BY A BROAD SPECTRUM OF STIMULI 


PETER JECKER, MD 
MAINZ, GERMANY 
ANDREW S. MCWILLIAM, PHD 
PERTH, AUSTRALIA à 


WOLF J. MANN, MD, PHD 
f MAINZ, GERMANY 
PATRICK G. HOLT, PHD 
PERTH, AUSTRALIA 


Clinically, the subglottic and glottic mucosae may react differently, eg, during acute laryngotracheitis. In healthy rats, we showed 
previously that the composition of the mucosal immune system of the larynx also differs between these areas. Neutrophils, lympho- 
cytes, and dendritic cells (DCs) are part of this mucosal immune system. In particular, DCs occupy a key function. They migrate into 
inflamed mucosae during the early phase of the immune response, which is normally characterized by an influx of neutrophils. Thus, 
they help to overcome the time lag between the innate and the adaptive immune responses. In the present study, the influx of DCs, 
neutrophils, and T lymphocytes into the subglottic and glottic mucosae of rats was examined at different time points after challenge 
with a broad spectrum of stimuli such as dead Moraxella catarrhalis, viable Bordetella pertussis, viable Sendai virus, and the soluble 
protein ovalbumin. The number of DCs increased rapidly after the application of the antigens. This increase was as rapid as the 
increase in neutrophils. Depending on the kind of antigen, their number in the mucosa increased up to 1,000 cells per 0.1 mm? 
(Sendai virus). The comparison of different mucosal areas shows that an overwhelming number of immunocompetent cells entered 
the subglottic mucosa, whereas only a few cells migrated into the adjacent glottic mucosa. In conclusion, after inhalation of different 
kinds of antigens, the subset of immunocompetent cells investigated in this study entered the laryngeal mucosa in high numbers. The 
number of DCs entering the laryngeal mucosa was higher than the numbers of the other immune cells investigated. This finding 
underlines their function as first-line sentinels of the mucosal immune system of the larynx. The observation that the number of cells 
entering the laryngeal mucosa is location-dependent indicates the ability of adjacent laryngeal regions to react differently. This is 


similar to the clinical observation of a selective subglottic reaction during acute laryngotracheitis. 


KEY WORDS — acute laryngotracheitis, dendritic cell, mucosal immune system. 


INTRODUCTION 
Infectious laryngotracheitis, so-called pseudo- 


croup, is a disease that preferentially affects the sub- ` 


glottic mucosa.! As new therapeutic strategies are 
discussed, such as active immunization,? it is still 
unknown why there is selective swelling of the sub- 
glottic mucosa and why the glottic mucosa does not 
react in the same. way. One reason for this phenom- 
enon may be that the composition of the mucosal 
immune system differs between the subglottis and 
glottis. In previous studies, we were able to demon- 
strate in healthy young rats that the number of im- 
munocompetent cells in the subglottic mucosa out- 
numbered that in the glottic mucosa.3-4 


The host immune response that occurs after inha- 
lation of antigens can be divided into innate and ac- 
quired immune mechanisms. The first is character- 
ized by the rapid influx of neutrophils in response to 
local chemotactic factors. This step of the innate im- 
mune system is followed by an influx of macrophages 
(adaptive immune mechanism) that take up the anti- 


gen and present it to T lymphocytes (induction of 
the cellular immune response). The latter response 
normally occurs 48 hours after the induction of the 
innate immune response. However, the delayed re- 
cruitment of the adaptive immune response provides 
a time span for the establishment and spread of in- 
haled pathogens that may be dangerous in cases in 
which no memory cells against these pathogens ex- 
ist. Some studies indicate that dendritic cells (DCs) 
function as first-line sentinels in the immune surveil- 
lance of the upper airways and the lung.> They mi- 
grate into the mucosae from a circulation pool of pre- 
cursor cells and take up incoming antigens. After- 
ward, they migrate to peripheral lymph nodes, where 
they present the antigen to T lymphocytes nearly im- 
mediately after the inhalation of a pathogen.° In this 
way, they may prevent the establishment of incom- 
ing pathogens and are the most important subpopu- 
lation for the induction of the adaptive immune re- 
sponse.’ 


To examine the influx of DCs into the mucosa of 
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the larynx, we induced experimental acute laryngo- 
tracheitis in young rats with different antigens. We 
selected ovalbumin (soluble antigen), heat-killed Mo- 
raxella catarrhalis (MC; dead bacteria), viable Bor- 
detella pertussis (BP; viable bacteria), and Sendai 
virus (viable virus), which is structurally and sero- 
logically related to parainfluenzavirus type 1, the 
main pathogen of pseudocroup in humans.! At dif- 
ferent time points after the application of the anti- 
` gens, the numbers of DCs, neutrophils, and T lym- 
phocytes were examined in the subglottic and glot- 
tic mucosae. 


MATERIAL AND METHODS 


Preparation and Application of Moraxella catar- 
rhalis. The MC was grown in Mueller-Hinton broth 
(37°C, 24 hours), centrifuged, and washed 3 times 
before resuspension at 10° colony-forming units 
(CFU) per milliliter. Immediately before aerosoliza- 
tion, the bacterial suspension was heated at 60°C for 
1 hour and passed vigorously through a 26-gauge 
needle 5 times. Afterward, 5-week-old PVG rats (n 
= 20) were exposed to the heat-killed bacteria (10 
mL of the bacterial suspension, by means of an inha- 
lation exposure apparatus from Tri-R Instruments, 
New York, New York; Fig 1). The animals were ex- 
sanguinated 1, 2, 4, 6, and 12 hours after antigen 
exposure. Four animals serving as the control group 
were exposed to sodium chloride 0.9% and exsangui- 
nated after 2 hours. 


Preparation and Application of Viable Bordetella 
pertussis. Viable BP bacteria (strain W28, provided 
by Dr P. Nowotny, Kent, England) were stored at 
—70°C until use. They were incubated for 72 hours 
on Charcol agar (Oxoid Australia, Melbourne, Aus- 


FILTER 


Fig 1. Scheme of Tri-R airborne infection ap- 
paratus. Suspension of antigens is placed in 
nebulizer unit. Air under pressure atomizes sus- 
pension, which is mixed with filtered room air 
(arrows) drawn into system by suction beyond 


+ decontamination system. Animals are placed 
into compartmented mesh basket within cham- 
ber. 

VACUUM PUMP 


tralia) containing cephalexin (40 uL/mL) and 10% 
defibrinated horse blood (State Health Laboratories, 
Perth, Australia). Then they were harvested, washed, 
and resuspended in:3 mL phosphate-buffered saline 
solution (PBS) to a final concentration of 6 x 10° 
bacteria per milliliter. Under ether anesthesia, the la- 
rynges of 16 brown Norway rats were inspected with 
a conventional otoscope, and BP (60 uL) was injected 
directly into the lumen of the larynges of the rats. 
with a 26-gauge cannula. The animals were exsan- ` 


_ guinated 12, 24, 48, and 96 hours after application 


of the viable bacteria. Four control animals were in- 
jected with 60 uL PBS and were exsanguinated 24 
hours later. 


Preparation and Application of Sendai Virus. Sen- 
dai virus (Enders strain’) was propagated in the allan- 
toic cavities of 9-day-old fertilized chicken eggs for 
3 days. The allantoic fluid was stored at —70°C until 
used. The virus was inoculated intranasally in brown 
Norway rats as previously described.? At various time 


-points after application of the virus (2, 5, 7, and 14 


days), the animals (each time point, n = 5) were ex- 
sanguinated under ether anesthesia and the larynx, 
including the first 4 tracheal rings, was removed. Con- 
trol animals (n = 8) were inoculated with allantoic 
fluid alone and exsanguinated at the same time points 
(each time point, n = 2). 


Ovalbumin Application. At the age of 4 weeks, 
PVG rats (n = 25) were immunized intraperitoneally 
by injection of 100 ug ovalbumin (OVA; Sigma, Mu- 
nich, Germany) dissolved in 500 uL PBS (pH 7.2) 
containing 10 mg aluminum hydroxide (Withehall, 
Sydney, Australia). At the age of 6 weeks, 20 ani- 
mals were exposed to aerosolized OVA (10 mg/mL) 
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Fig 2. Numbers (mean + SD) of 
dendritic cells, neutrophils, and T 
lymphocytes in mucosae of con- 
trol animals (time point = 0) and 
at various time points after expo- 
sure to dead bacteria (Moraxella 
catarrhalis; n = 4 at every time 
point), viable bacteria (Bordetella 
pertussis; n = 4 at every time 
point), viable virus (Sendai virus; 
n=5 at every time point), or sol- 
uble protein antigen (ovalbumin; 
n=5 at every time point). If num- 

- ber of cells was less than 5 cells 
per 0.1 mm?, data of animals were 
pooled. Asterisks indicate values 
that are significantly different 
from control values of same re- 
gion (p < .05). A) Subglottic mu- 
cosa. B) Glottic mucosa. 
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for 30 minutes. The animals were exsanguinated 18, 
24, 48, and 72 hours after the aerosolization. Five 
animals serving as a control group were exposed to 
‘aerosolized sodium chloride 0.9%, and they were ex- 
sanguinated after 24 hours. 


ae Preparation of. Specimens and Immunohistochemi- 
cal Analysis. When the larynges were removed, they 
were filled with OCT compound (Miles, Naperville, 
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Indiana) and snap frozen in liquid nitrogen, and cryo- 
sections 6 um thick were cut in a frontal direction. 
Staining of surface antigens was performed as de- 
scribed elsewhere.5-!0 Briefly, the slides were fixed 
in 100% ethanol and incubated for 60 minutes at 
room temperature in a moist chamber with primary 
antibodies (DC Ox62; neutrophilic granulocytes RP3; 
o/B T lymphocytes R73411). Negative staining con- 
trols were performed by omitting the first antibody. 
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Afterward, the slides were incubated with the sec- 
ond antibody (rabbit anti-mouse, Dako, Hamburg, 
Germany) for 60 minutes and with the third antibody 
complex (peroxidase anti-peroxidase, Dako) for 60 
minutes. The color reaction was carried out with ben- 
zidine (Sigma, Munich), and the slides were coun- 
terstained with hematoxylin (Fluka, Buchs, Switzer- 
land). 


Tissue Evaluation and Statistical Analysis. The 
numbers of DCs, neutrophils, and T lymphocytes 
were determined in the subglottic and glottic muco- 
sae with a conventional light microscope at 500x 
magnification as described previously.4+ The cells 
were counted within the epithelium and within the 
subepithelial area by means of a graticule in the eye- 
piece. At least 100 cells of each subpopulation were 
counted in every region per animal, unless stated oth- 
erwise. Means and standard deviations were deter- 
mined (SPSSPC*, version 4.0.1; SPSS Inc, Chicago, 
Illinois). Differences between the subglottic and glot- 
tic areas were tested for significance with the Wil- 
coxon rank sum test. They were considered signifi- 
cant at p < .05. Differences between the time points 
and differences between the numbers of cells in the 
investigated subpopulations were tested for signifi- 
cance with the Mann-Whitney U test and were con- 
sidered significant at p < .05. 


RESULTS 


Influx of Cells Into Subglottic Mucosa. After ad- 
ministration of MC, BP, and Sendai virus, the in- 
fluxes of all cell subpopulations were measured, and 
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Fig 3. Numbers (mean + SD) of 
dendritic cells in subglottic and 
glottic epithelia of control ani- 
mals (time point = 0) and at vari- 
ous time points after exposure to 
dead bacteria (M catarrhalis; n = 
4 at every time point), viable bac- 
teria (B pertussis; n = 4 at every 
time point), viable virus (Sendai 
virus; n = 5 at every time point), 
or soluble protein antigen (oval- 
bumin; n=5 at every time point). 
If number of cells was less than 5 
cells per 0.1 mm?, data of animals 
were pooled. Asterisks indicate 
significant differences between 
numbers of dendritic cells in sub- 
glottic and glottic epithelia (p < 
.05). 
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similar kinetics were observed for each influx (Fig 
2A). However, DCs surpassed neutrophils and T lym- 
phocytes at every time point (p < .01). In every ex- 
periment, the number of DCs was 3 to 4 times higher 
than the numbers of neutrophils and T lymphocytes 
at the peak of the inflammation. An influx of DCs 
and T cells was also observed after the inhalation of 
the soluble protein antigen OVA, but neutrophils were 
not detected here in greater numbers. In comparing 
the four animal models, it became obvious that the 
time course of the cell influx varied among the ex- 
periments. The most rapid influx could be seen after 
inhalation of MC. It started directly after the appli- 
cation of the antigen, and the highest number of cells 
was detected after 2 hours. In contrast, after inocula- 
tion of Sendai virus, the peak of the cell influx was 
detected after 5 days. 


Difference Between Subglottic and Glottic Muco- 
sae. The pattern of the DC influx into the glottic mu- 
cosa (Fig 2B) was quite similar to that into the sub- 
glottic mucosa after application of MC, Sendai virus 
and OVA. Similar to the subglottic mucosa, the num- 
ber of DCs exceeded the numbers of other immuno- 
competent cells (p < .01). After the application of 
BP, the number of immunocompetent cells in the glot- 
tic mucosa did not change. The comparison of the 
glottis and subglottis showed that the numbers of 
DCs, neutrophils, and T lymphocytes were always 
higher in the subglottic mucosa than in the glottic 
mucosa (p < .05). 


Difference Between Glottic and Subglottic Epithe- 
lia. The kinetics of the DC influx were comparable 
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in the subglottic and glottic epithelia (Fig 3). How- 
ever, the number of DCs in the epithelium was al- 
ways higher than that in the subepithelial compart- 
ment (p < .01). As a result, the cell number calcu- 
lated in the epithelium was relatively higher than the 
cell number calculated for the whole mucosa (Fig 
2). In all experiments, as many as 4 times more DCs 
were detected in the subglottic epithelium than in 
the adjacent glottic epithelium. This tendency was 
also true for neutrophils and T lymphocytes. 


DISCUSSION 


Pseudocroup is a disease of young children.! Pa- 
tients suffer from whooping cough and respiratory 
distress that is caused by selective swelling of the 
subglottic mucosa after the inhalation of pathogens. 
In contrast, the adjacent glottic folds remain almost 
unaffected. The mechanisms underlying the selec- 
tive swelling of the subglottic mucosa are unknown. 
There may be a difference in the immune reaction 
between the two laryngeal areas. The immune reac- 
tion following inhalation of pathogens can be divided 
into 2 steps. Immediately after the inhalation of path- 
ogens, an influx of neutrophils (innate immune re- 
sponse) is observed. Later, commonly after 48 hours, 
macrophages migrate into the inflamed mucosa and 
take up antigens and present them to T lymphocytes 
(adaptive immune response). In this model, the adap- 
tive immune response serves as an optional backup 
to the innate host defense in situations in which high 
numbers of antigens persist after the influx of neu- 
trophils. However, the delayed recruitment of adap- 
tive immune mechanisms provides a time span for 
pathogens to spread in the inflamed tissue. It is known 
that DCs are the most capable subpopulation that 
takes up antigens, processes them, and presents them 
to T lymphocytes.” They derive from a circulating 
monocytelike precursor pool, migrate through airway 
mucosa, and take up antigens shortly after they pene- 
trate. Thus, DCs may close the time lag between in- 
nate and adaptive immune responses by triggering a 
T cell response immediately after inhalation of the 
antigen. The influx of DCs, neutrophils, and T lym- 
phocytes provides information about the course of 
the immune reaction in the laryngeal mucosa and was 
studied during experimental acute laryngotracheitis 
in young rats. Acute laryngotracheitis was induced 
by a broad spectrum of stimuli that were known to 
cause an influx of immunocompetent cells into the 
mucosa of the trachea.!2 These included heat-killed 
MC (dead bacteria), which is known to be a strong 
stimulus for the induction of experimental airway in- 
fection® and which is also known to induce acute lar- 
yngotracheitis.13 Furthermore, an acute inflamma- 
tion was induced by the intralaryngeal application 


of viable BP. As a viable virus model, Sendai virus 
was given by intranasal inoculation followed by con- 
stant inhalation of the pathogen. Sendai virus is struc- 
turally and serologically related to parainfluenza vi- 
tus type 1, the main pathogen of pseudocroup in hu- 
mans,! and therefore, this model closely reflects the 
physiological situation in children. Finally, the ani- 
mals inhaled soluble protein antigen (OVA). The 
heat-killed MC, as well as OVA, was inhaled from a 
specially designed machine that was not used for the 
application of viable pathogens, in order to avoid con- 
tamination. Our results show that many DCs, neutro- 
phils, and T lymphocytes migrated into the laryngeal 
mucosa during the experiments. However, at every 
time point, the number of DCs exceeded the num- 
bers of the other subpopulations. This finding empha- 
sizes the extraordinary position of DCs in the muco- 
sal immune system of the larynx. More important, 
after the application of dead bacteria, as well as after 
the application of live bacteria and the virus, the num- 
ber of DCs increased as fast as the number of neu- 
trophils. Even in the very early phase of local inflam- 
mation, not only phagocytosis of invading pathogens 
by neutrophils, but also the initiation of a specific 
immune response, is of importance. This scenario is 
supported by our observations. The DC numbers also 
increased in the epithelium of the laryngeal mucosa, 
a location in which invading pathogens are found in 
high numbers very early in the course of infection. 
Furthermore, the involvement of the specific immune 
system even at very early time points during infec- 
tion was also indicated by the increase of T cells both 
subepithelially and within the epithelium. 


The kinetics of the cell influx differed among the 
4 experiments. The inhalation of dead MC and OVA 
led to the highest numbers of DCs, neutrophils, and 
T lymphocytes in the subglottic mucosa after 2 or 18 
hours. The peak cell count after application of BP 
and Sendai virus was detected after 24 hours or 5 
days. This different time course may be due to the 
mode of application (inhalation versus inoculation). 
The nucleoprotein of Sendai virus,® which was ap- 
plied intranasally, was detected in high amounts on 
the laryngeal epithelium only after 2 days (data not 
shown). In order to induce a full immune response, 
the virus has to damage the epithelium. This process 
requires more time than the direct application of great 
amounts of antigen, eg, heat-killed MC. 


The influx of immunocompetent cells into the la- 
ryngeal mucosa seems to be region-specific. In pre- 
vious experiments, we were able to demonstrate that 
the number of immunocompetent cells within the 
subglottic mucosa in healthy rats is higher than that 
in the glottic mucosa.?4 As our present experiments 
indicate, this difference between the compartments 
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increases after exposure to different kinds of antigens, 
although the patterns of cellular changes in the sub- 
glottic and glottic mucosae remain similar. Such an 
observation (strong subglottic reaction versus weak 
glottic reaction) is frequently made during acute hu- 
man laryngotracheitis. The present study shows that 
as many as 4 times more DCs, neutrophils, and T 
lymphocytes enter the subglottic mucosa than enter 
the glottic mucosa. Because we determined the num- 
ber of cells per area, and the actual volumes of the 
glottic and subglottic areas are unknown, we cannot 
provide the absolute number of cells entering the two 
compartments. However, the data indicate that dur- 
ing infection, the relative increase in the number of 
DCs, neutrophils, and T lymphocytes is much higher 
in the subglottic area than in the glottic area. Such a 
big difference of cellular influx into the mucosa of 
two directly adjacent locations is surprising, and may 
have more than one cause. It may be caused by dif- 
ferent adhesions of antigen on the epithelia of the re- 
gions as a result of different properties of the epithe- 
lia. Furthermore, the inflammatory response might 
be specifically regulated in the respective microen- 


vironments of the compartments by mechanisms that 
are so far unknown. However, a lower influx of DCs, 
neutrophils, and T lymphocytes in the glottic area 
may protect this region from the adverse effects of a 
strong immune response. In contrast, the high num- 
ber of immunocompetent cells in the subglottic area 
indicates a strong immune reaction. Although we as- 
sume that a strong immune reaction in the subglottic 
area leads to mucosal swelling, we did not prove this 
assumption by exact measurements. 


In summary, our data show that DCs are recruited 
into the subglottic and the glottic mucosae during the 
inflammatory response to a broad spectrum of stimuli 
such as dead bacteria, viable bacteria, viable virus, 
and soluble protein antigen. The data also show that 
DCs migrate into the inflamed mucosa in the early 
phase of the immune response. However, the subglot- 
tic and glottic mucosae, which are located next to 
each other, show significant differences in their re- 
sponsiveness during infection. Whether this pattern 
is also reflected by mucosal swelling still needs to 
be investigated in future studies. 
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MORPHOLOGY OF THE CRICOPHARYNGEAL MUSCLE IN ZENKER 
AND CONTROL SPECIMENS 
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The cricopharyngeal muscle (CPM) is essential for normal deglutition. Pharyngeal dysphagia commonly results from impaired 
or uncoordinated CPM dilation. Dysfunction of the CPM has also been implicated in the genesis of Zenker’s (pharyngoesophageal) 
diverticulum. Despite the CPM’s significance, little is understood about its morphology. We studied CPM biopsy specimens from 20 
patients with Zenker’s diverticulum and from 5 fresh cadaver patients with detailed histologic techniques to include fiber size and 
shape and adenosine triphosphatase, reduced nicotinamide adenine dinucleotide, trichrome, succinate dehydrogenase, cytochrome C 
oxidase, periodic acid—Schiff reaction, oil red O, acid phosphatase, Congo red, crystal violet, and monoadenylate deaminase stains. 
The normal CPM has unique morphological characteristics, with some myofibers having staining properties that are a hybrid between 
striated muscle and muscle spindle. The variable orientation of the muscle fibers is also different from that of most other striated 
musculature. Of the 20 Zenker CPM specimens, 4 specimens did not reveal any significant differences from controls (2 of which had 
insufficient amounts of tissue for complete analysis). In the remaining 16 specimens, several abnormalities existed, including excessive 
size variation (16/16), grouping of atrophic fibers (9/16), target or targetoid formations (4/16), cores (2/16), and ragged red fibers (2/ 
16). The final pathological pattern of the 16 specimens was neurogenic in 7, myopathic in 4, and mixed (with neurogenic predominance) 
in the remaining 5. Two specimens contained significant lymphocytic inflammatory infiltrates. We conclude that the unique 
neuromuscular function of the CPM in deglutition is likely due to its fiber orientation and the hybrid nature of some of the myofibers. 
Morphological disturbances of the CPM impair its dilation and may account for the development of Zenker’s diverticulum. This 
disturbance is most often due to progressive denervation of the CPM. 


KEY WORDS — deglutition, esophageal diverticulum, neuromuscular disease, cricopharyngeal muscle, pharynx, Zenker’s 


diverticulum. 
INTRODUCTION 


Pharyngoesophageal (Zenker’s) diverticula consist 
of a protrusion of pharyngeal mucosa between the 
inferior constrictor muscle and the cricopharyngeal 
muscle (CPM), a dehiscence identified in 1908 by 
Killian.! The cause of these diverticula is considered 
to be mechanical or pulsion-related, associated with 
poorly coordinated or incomplete dilation of the 
CPM.” The.CPM may be dysfunctional because of 
neurogenic changes, myopathic changes, or both. Few 
studies have focused on normal CPM histologic find- 
ings, and even fewer have investigated the CPM his- 
topathologic findings associated with Zenker’s di- 
verticulum. Furthermore, the studies currently avail- 
able demonstrate inconclusive morphological CPM 
findings. The aim of this study was to determine nor- 
mal CPM histologic findings and the histopathologic 
findings associated with Zenker’s diverticulum. With 
this knowledge, it may be possible to gain a better 
understanding of the cause of Zenker’s diverticulum, 
which could help target future management and pre- 
vention strategies. 





METHODS 


The measurements and observations were made 
on a total of 25 specimens. The CPM biopsy speci- 
mens were harvested from 20 patients with Zenker’s 
diverticulum and 5 fresh (<12 hours) cadavers of sub- 
jects with no known pharyngoesophageal dysfunc- 
tion. All specimens were analyzed by interpreters (au- 
thors S.K.D. and S.S.J.) blinded to patient history 
and indication for CPM biopsy. 


The patients consisted of 14 men and 6 women 39 
to 92 years of age (mean, 71 years) who presented to 
our institution for surgical management of Zenker’s 
diverticulum (diverticulotomy and CPM myotomy) 
from December 1993 through November 1999. Three 
patients had recurrent Zenker’s diverticulum after un- 
dergoing previous diverticulotomy without CPM my- 
otomy at an outside institution. The cadavers from 
whom the control specimens were harvested included 
3 men and 2 women with an age range from 64 to 80 
years (mean, 67 years). None of the patients or cadav- 
ers had a history of neck irradiation. 
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= TABLE 1. STUDY RESULTS 








Size Ragged Red 
CPM Specimens Variation Fibers 
Zenker 16/18 2/18 
Control 0/5 0/5 


Atrophic 


Grouping 








Target or 
Fiber Targetoid Inflammatory Fiber 
Formations Infiltrate Size (um) 
9/18 4/18 2/18 50 
0/5 2/5 0/5 4] 


Numerators indicate number of cricopharyngeal muscle (CPM) specimens with above findings; denominators indicate total number of specimens 


in given category. 





Muscle biopsy tissue was received fresh, oriented 
in cross section, and immediately snap-frozen in iso- 
pentane cooled in liquid nitrogen. Tissue blocks were 
stored at —70°C until processed for histochemical 
analysis. 

Muscle tissue was cryosectioned at —20°C on a 
Leica 1800 cryostat (Jung Histology Products, Nuss- 
loch, Germany). All tissue sections were picked up 
at a thickness of 8 um, and the following histochemi- 
cal stains? were used: hematoxylin and eosin for mor- 
phological changes; Gomori modified trichrome, suc- 
cinate dehydrogenase, and cytochrome C oxidase for 
assessment of mitochondrial activity; myosin adeno- 
sine triphosphatase (ATPase; pH 9.4, 4.6, and 4.3) 
for fiber typing analysis; periodic acid—Schiff for gly- 
cogen; oil red O for lipid content; acid phosphatase 
for inflammatory activity; and Congo red and crystal 
violet stains for the presence of amyloid deposition. 

All histologic stains were visualized for diagnostic 
purposes with a BX40 Olympus microscope (Leeds 
Instruments, Minneapolis, Minnesota). Stained sec- 
tions were permanently imaged and photographed 
with a 3-chip color charge-coupled device camera 
(Dage MTI, Inc, Michigan City, Indiana) in conjunc- 
tion with Adobe Photoshop 4.0 (Adobe Systems Inc, 
San Jose, California). 


RESULTS 
All CPM specimens (Zenker and controls) had 





wide variation of shape within specimens, including 
round, angulated, and polygonal fibers. On analysis 
of the 20 Zenker-associated CPM specimens, 2 spec- 
imens had insufficient amounts of tissue for analysis 
and 2 other specimens showed no significant differ- 
ence from controls. One of the specimens without 
significant difference from controls was from a 39- 
year-old patient (the youngest patient in the group); 
the other specimen was from a 68-year-old patient. 


The remaining 16 specimens all showed excessive 
size variation. The Zenker specimens also demon- 
strated other histopathology, as demonstrated in Table 
1. Fiber histopathologic findings were classified as 
neurogenic in specimens on the basis of findings of 
increased fiber variation, angulated shape (Fig 1), 
pyknotic fibers (Fig 2), fiber grouping (Fig 3), target 
or targetoid formation (Fig 4), and excessive numbers 
of type IIB fibers. Myopathic changes were deter- 
mined by various factors, including the presence of 
ragged red fibers (Fig 5), inclusion bodies, and exces- 
sive inflammatory infiltrate with associated fiber de- 
generation (Fig 6). The final pathological pattern of 
the abnormal 16 specimens was neurogenic in 7, 
myopathic in 4, and mixed in the remaining 5. 


The fiber arrangement in the CPM was unique in 
that the majority of fibers were interwoven at near 
90° angles to one another. Thus, attempts to obtain 
cross sections resulted in a combination of transverse 





Fig 1. Hematoxylin and eosin stain at (original) 20x mag- 
nification demonstrates high variability of fiber size and 
angulation of fibers. 


Fig 2. Hematoxylin and eosin stain at (original) 40x mag- 
nification depicts pyknotic nuclear clump (arrowhead), 
which represents nuclei of degenerated fibers. 
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Fig 3. Adenosine triphosphatase stain at pH 4.6 dem- 
onstrates fiber types I, Ia, Hb, and IIc (see labels), which 
are differentiated according to stain. Many type I fibers 
demonstrate grouping (fiber[s] surrounded by same fiber 
types), which is representative of denervation-reinner- 
vation phenomenon. 


and longitudinal fibers in every direction in which 
the specimens were cut (Fig 7). The fibers appeared 
to be embedded in a connective tissue framework. 


A small but significant number of longitudinal fi- 
bers demonstrated a unique zebra-staining pattern (Fig 
8). On cross section, these appeared to be grouped, 
with 4 darkly staining larger fibers and 2 light-staining 
fibers with less than half the cross-sectional diameter 
of the larger fibers. They were not encapsulated. 


DISCUSSION 


When dilation of the CPM is impaired, the final 
phase of oropharyngeal swallowing results in in- 
creased hypopharyngeal bolus pressure. If the CPM 
will not adequately dilate, the bolus pressure will be 
transferred to the area of least resistance, ie, Killian’s 
triangle. The repeated action of increased pressure 





Fig 4. Reduced nicotinamide adenine dinucleotide stain 
depicts target formation (arrowhead), which is resultant 
from denervation-reinnervation phenomenon. 


an 
a 


Fig 5. Cytochrome C oxidase at (original) 60x magnifica 
tion shows ragged red fibers, which are abnormal dense 
accumulations of mitochondria. 


will cause protrusion of hypopharyngeal mucosa into 
the retropharyngeal space and ultimately result in a 
Zenker’s diverticulum. It is well established that par- 
tial CPM myectomy will greatly relieve dysphagia 
in patients with diverticula. Dilation is permanent 
after the myectomy, and the bolus will pass unevent- 
fully through the cricopharyngeal area. 


The CPM has previously been described as a stri- 
ated muscle in a dense connective tissue framework." 
Excessive fiber size variation in normal CPMs has 
been attributed to the fusiform nature of the muscle 
fibers. This study failed to demonstrate excessive fi- 
ber size variation in healthy CPM specimens. How- 
ever, because the fiber vectors were frequently at sig- 
nificant (up to 90°) angles to one another, the longitu- 
dinal sections and cross sections of fibers on the same 
slide often gave an illusion of excessive variation. 

Histologically, the CPM consists of predominantly 
type I muscle fibers with highly oxidative activity, 





Fig 6. Hematoxylin and eosin at (original) 20x magnifica- 
tion shows increased amounts of connective tissue, in 
flammation, and basophilic fibers. 
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Fig 7. Adenosine triphosphatase stain at pH 4.3 demon- 
strates unique orientation of fibers. 





and no type IIA fibers.> Another unique characteris- 
tic of the CPM is its unusual type I ATPase staining 
properties. Whereas most type I muscle fibers stain 
light in an alkaline pH of 9.4, CPM type I fibers do 
not lighten until the pH is raised to 10.4.5 The type I 
ATPase staining properties are thought to be due to 
the highly oxidative character of the fibers. The unique 
staining characteristics of the CPM are consistent with 
primitive nondifferentiated fetal musculature and 
branchial arch muscles, and not other adult striated 
musculature as reported in prior studies.*67 Previ- 
ous studies have also reported normal CPM and other 
laryngeal musculature to have increased subsarcolem- 
mal staining and ragged red fibers57 (Table 22-8). 
Our experience suggests that these features are char- 


© 





Zenker CPM 






TABLE 2. PREVIOUS CPM STUDIES 


\ 
“a 


Fig 8. Adenosine triphosphatase stain at pH 4.3 demon- 
strates zebra or tigroid appearance of some longitudinal 
fibers. 


acteristic of mitochondrial myopathy and are unlikely 
to represent a truly normal finding. 

Other unique features of the CPM are the pres- 
ence of alternating staining patterns within several 
longitudinal fiber sections and the grouping of these 
fibers in a unique pattern. The histologic appearance 
of these CPM fibers imitates muscle spindles. Al- 
though previous studies have not found muscle spin- 
dles in the CPM, perhaps this is because some of the 
CPM fibers themselves are spindlelike in nature. This 
could explain why CPM dilation at the end of oropha- 
ryngeal swallow may be an active rather than a pas- 
sive act. The “stretch” stimulus of a bolus passing 
downward may induce active relaxation followed by 











Control or Normal CPM Specimens 





Cruse et al? 


Lerut et alë 


Bonington et al 





Kristmundsdottir et al’ 





Brownlow et al? 


Lerut et alë 


Fiber size 25.5 um 40.3 um 28 um 
Fiber size variation 40/41 Found in many Found in many 2/15 
areas areas 
Fiber predominance Type I Type I (85.9%) Type I (82%) Type I Type | 
(70%) 
Reduced nicotinamide Strong staining Strong staining 
adenine of all fiber types of all fiber types 
dinucleotide 
Ragged red fibers 23/41 Found in many 2/15 
areas 
Fibrotic tissue Found in few 31/41 2/15 
areas 
Nemaline rods 4/41 Found in many 0/15 
areas 
Central nuclei 30/41 Found in few 8/21 1/15 
areas 
Inflammatory 21/41 Found in many Found in few 0/15 
infiltrate areas areas 
Subsarcolemmal Found in few Found in many 
crescents areas areas 


Blank areas indicate lack of data. Exact numbers are reported when available (versus total numbers of specimens). 
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a contraction similar to that induced by stretching 
muscle spindles after fusimotor efferent stimulation. 
Similar sphincteric activity occurs elsewhere, in the 
urethral and rectal dilatory actions,? but no known 
studies have previously reported muscle spindle ana- 
logs in CPM or other sphincteric musculature. 


The findings in this study are to a large extent in 
agreement with those of previous authors (Table 2), 
most specifically with those of Lerut et al,8 who were 
the only previous investigators to analyze Zenker- 
associated CPM specimens with a sample of more 
than 10. In addition to the findings in Table 2, Lerut 
et al demonstrated weak or absent acetylcholinester- 
ase and neurofilament staining in over 75% of Zenker- 
associated CPM specimens. These findings are most 
often suggestive of neurogenic disorder. Fiber type 
grouping and increased size variation are also sugges- 
tive of neurogenic deficit, and both Lerut et al and 
this study demonstrated these in the majority of Zen- 
ker-associated CPM specimens. Despite these find- 
ings, Lerut et al did not classify the histopathologic 
findings as neurogenic because of the significant myo- 
pathic changes that often coexisted in the specimens. 
These myogenic changes include necrosis, central 
nuclei, nemaline rods, ragged red fibers, and inflam- 
mation. 


This study’s finding of neurogenic degeneration 
in 12 of 16 CPM specimens is significant. In con- 
trast to the study of Lerut et al,’ myopathic changes 
were found to be less common. Of the 12 specimens 
with neurogenic degeneration, changes suggestive of 
both myopathy and denervation coexisted in 5 speci- 
mens; myopathic changes alone were found in 4 spec- 
imens. It is possible that myopathic changes were 
secondary to denervation, or both the myopathic and 
neurogenic findings were in response to a common 
insult. It is also possible that the myopathic changes 
were secondary to mechanical forces or pressure from 
the overlying diverticulum. In contrast, neurogenic 
changes would not likely be explained by such a 
mechanism. Thus, with the majority of specimens in 
both previous studies*8 and the current study demon- 
strating evidence of neurogenic deficit, one can con- 
clude that neurologic deficit of the CPM is a major 
etiologic factor in many cases of Zenker’s diverticu- 
lum. 


The anatomic implication of a neurogenic cause 

` remains vague, as the exact innervation of the CPM 
is still uncertain. While the vagus nerve is thought to 
innervate the CPM in some manner, the exact path- 
way is yet unclear. Three reports strongly support 


innervation from the recurrent laryngeal nerve 
(RLN).!°12 Netter, !3 in his 1959 anatomy text, indi- 
cates that the superior laryngeal nerve (SLN) inner- 
vates the CPM. His 1997 anatomy text, edited by a 
renowned practicing anatomist, continues to depict 
this relationship.!4 In a recent anatomic dissection 
presentation, a branch of the external SLN extended 
to the CPM (R. J. Toohill et al, unpublished observa- 
tions). This branch is much larger than the nerve that 
extends to and innervates the cricothyroid muscle. 
This branch was previously identified, but was la- 
beled as a “communicating” branch between the SLN 
and the RLN (I. Sanders, unpublished observations). 
Future efforts to definitively identify the innervation 
of the CPM and the denervation associated with 
Zenker’s diverticulum may lead to better treatment 
and prevention modalities. 


One limitation of this study is the lack of power in 
the sample size. Certain findings, such as 2 of our 
Zenker specimens’ being no different from controls, 
may thus be due to chance. Interestingly, 1 of the pa- 
tients with a specimen no different from controls was 
the youngest patient among the Zenker and control 
groups, and significantly younger than the average 
(70 to 90 years old) Zenker patient. Although histo- 
logic changes of the CPM that occur with age have 
not-been well established, they clearly could be a 
confounding factor. 


This study was also limited in the interpretation 
of the causation of the myopathic and mixed changes 
found in some specimens. If the myopathic changes 
found are a result of Zenker-induced mechanical ir- 
ritation, such as a direct pressure effect from an over- 
lying diverticulum, a relationship may be established 
between the severity of myopathic changes and pre- 
operative diverticulum duration and/or diverticulum 
size. Further studies are necessary to investigate such 
a relationship. Future studies would also be useful 
in correlating histopathologic findings with physio- 
logic testing methods such as electromyography, thus 
defining the relationship between myopathic and neu- 
rogenic changes seen in mixed specimens. 


CONCLUSION 


This study demonstrates the CPM to have fibers 
with unique orientation and some myofibers of a hy- 
brid nature, which likely contribute to the unique func- 
tion of the CPM with deglutition. Morphological dis- 
turbances, which are most often due to progressive 
denervation, impair CPM dilation and may account 
for the development of Zenker’s diverticulum. 
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Dynamic remodeling of neuromuscular junction (NMJ) structure is postulated as a cause of age-related muscular atrophy. Direct 
study of NMJ morphology in laryngeal muscles is important to our understanding of age-related decrements in voice and swallow- 
ing. The morphology of NMJs was studied in a rat model to compare young and old specimens of thyroarytenoid muscle — a muscle 
critical to airway protection and phonation. Fluorescent, triple-label immunohistochemical analysis and confocal microscopic visu- 
alization were used to analyze the structure of NMJs. We found that laryngeal NMJs underwent significant changes that were similar 
to those observed after denervation. Specifically, the axon terminal area was significantly reduced, there were a number of postsyn- 
aptic acetylcholine receptor areas unoccupied by nerve terminals, and there was increased variability in end plate architecture in the 
old muscles. The results of this study increase our understanding of the age-related morphological changes in the larynx, and may 
serve as a baseline to test the effectiveness of future interventions. 


KEY WORDS — aging, confocal microscopy, neuromuscular junction, thyroarytenoid muscle. 


INTRODUCTION 


Fatigue and weakness are common complaints of 
elderly persons. These symptoms are often related 
to sarcopenia, or loss of skeletal muscle mass, orga- 
nization, and strength. Structurally, sarcopenia is as- 
sociated with a reduction in muscle mass and cross- 
sectional area, a reduction in the number or size of 
muscle fibers, and a transformation of muscle fiber 
types.!-4 Such signs of muscle atrophy begin at 25 
years of age in humans and accelerate with increas- 
ing age,> such that limb muscles in older men and 
women are 25% to 35% smaller than those in young 
adults.2 Denervation-reinnervation is postulated as 
a major cause of sarcopenia,!° as aging undoubtedly 
involves a dynamic revision of muscles, nerves, and 
their connections.®7 


At the neuromuscular junction (NMJ), the hall- 
marks of denervation-reinnervation are sprouting 
(thin projections from the axons, nodes of Ranvier, 
preterminal branches, or nerve terminals) and axonal 
withdrawal, which are features also noted in aged 
muscle.®8 In denervated muscle, thin projections 





from terminal Schwann cells often precede and guide 
axon terminal sprouting.?:!9 Postsynaptic changes 
may be manifested in aged and denervated NMJs as 
the fragmentation of acetylcholine (ACh) receptor 
structure.8-!! These morphological changes in aged 
and partially denervated muscle may have physio- 
logical consequences, such as a diminished ability 
to sustain synaptic transmission!-!3 and reduced force 
generation. !4 Therefore, structural changes at the NMJ 
may be important mechanisms underlying sarcopenia 
in the aged. 


Sarcopenia affects voice and swallowing.!5-!9 
However, little is known about the underlying patho- 
physiology of senescent muscles in the upper aero- 
digestive tract, because most work has been per- 
formed in limb muscles. Because aging does not af- 
fect all muscles or muscle fiber types equally,” it is 
necessary to study cranial muscles directly to dis- 
cern age-related changes that may affect critical func- 
tions of speech and swallowing. 


The thyroarytenoid (TA) muscle contributes to the 
generation of vocal fold tension during phonation and 
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is active during the glottic closure associated with 
the pharyngeal phase of the swallow.”! Accordingly, 
this intrinsic muscle of the larynx is an important com- 
ponent of both voice production and airway protec- 
tion, Age-related changes in muscle fiber type,22.23 
overall innervation structure,24 contractile func- 
tion,?5.26 and kinematics?’ have been suggested by 
prior work in the TA muscle. Collectively, these data 
are highly suggestive of underlying changes in NMJ 
morphology. The hypothesis underlying this research 
is that age-related alterations in upper aerodigestive 
tract function are based upon naturally occurring de- 
nervation-reinnervation processes. The goal of this 
study was to evaluate NMJ morphological differences 
in young and old rat TA muscles and to determine 
whether alterations suggestive of partial denervation 
are found in the old muscle. 


METHODS 


Muscle Extraction. A rat model was used in these 
experiments, because denervation-induced altera- 
tions in rat TA structure and physiology have been 


. well described.28-3! Thyroarytenoid muscles were 


studied in 7 young adult (9 months old) and 5 older 
(24 months old) rats of the F1 hybrid (a cross of Fis- 
cher 344 and brown Norway) obtained from a Na- 
tional Institute on Aging animal colony. The 5 older 
rats and 5 of the young rats were anesthetized with 
an intraperitoneal injection of a mixture of ketamine 
hydrochloride (90 mg/kg) and xylazine hydrochlo- 
ride (9 mg/kg) and euthanized via intracardiac injec- 
tion of 7.2% chloral hydrate. Whole laryngeal speci- 
mens were extracted immediately after euthanasia, 
and the TA muscles were identified and extracted. In 
lactated Ringer’s solution, connective tissue was re- 
moved from the TA muscle with microscissors un- 
der a surgical microscope. . 


As a- comparison, the 2 remaining young animals 
served as denervation controls. These rats were anes- 
thetized, the larynx was exposed, and the recurrent 
laryngeal nerve on the left side was crushed with for- 
ceps. The rats were allowed to recover and returned 
to the animal care facility. Four weeks later, these 2 
animals were again anesthetized and left-sided laryn- 
geal paralysis was confirmed by a rat microlaryngo- 
scopic technique developed in our laboratory.?? Brief- 
ly, the anesthetized animal was placed into a malle- 
able lead operating platform in a near-vertical posi- 
tion with the mouth held open. A custom-made wire 
laryngoscope was then inserted into the mouth, and 
the animal’s position was adjusted to maximize the 
view of the vocal folds under a Zeiss surgical micro- 
scope. After unilateral vocal fold paralysis was visu- 
ally confirmed, the animals were euthanized as de- 
scribed above. Whole laryngeal specimens were har- 


vested, and the TA muscles were identified and ex- 
tracted. We included these denervation controls to 
provide evidence for NMJ alteration in the TA muscle 
as a function of a known denervation state, rather 
than for formal statistical comparisons with either 
the young or old groups. 


Immunohistochemistry. Triple labeling of NMJ 
components was performed.32 The muscles were 
fixed in 4% paraformaldehyde (pH 7.4) for 20 min- 
utes at room temperature and then rinsed 3 times for 
10 minutes in phosphate-buffered saline solution 
(PBS). The muscles were then stained with 1% a- 
bungarotoxin (BTX) conjugated to rhodamine (Mo- 
lecular Probes Inc) for 15 minutes at room tempera- 
ture. The BTX is highly selective for ACh receptors. 
After rinsing with PBS, the muscles were incubated 
in methanol at —20°C for 5 minutes and then rinsed 
again in PBS. The muscles were then blocked with 
PBS containing 0.2% Triton X-100, 2% bovine se- 
rum albumin, and 0.1% sodium azide for 1 hour. The 
following primary antibodies were diluted in block- 
ing buffer: 1:200 anti-neurofilament (SMI 31, mouse 
IgG1, Sternberger, Lutherville, Maryland), 1:100 an- 
ti-synaptic vesicle (SV2, mouse IgG1, Dev. Studies 
Hybridoma Bank, Iowa City, Iowa), and 1:250 anti- 
$100 for the Schwann cells (S100, rabbit IgG, Dako, 
Carpinteria, California). These antibodies were ap- 
plied overnight at room temperature. On the second 
day, the muscles were rinsed, and secondary antibod- 
ies were diluted in blocking buffer and applied for 4 
hours at room temperature. The secondary antibod- 
ies included 1:100 CY5 conjugated to goat anti-mouse 
(H+L, Jackson Immunoresearch Laboratories, West 
Grove, Pennsylvania) and 1:100 FITC conjugated to 
donkey anti-rabbit (H+L, Jackson Immunoresearch 
Laboratories). The muscles were rinsed again and 
then mounted on slides, covered with Vectashield 
mounting medium (Vector Laboratories, Burlingame, 
California) and a coverslip, and sealed. The whole 
muscle specimens were typically 70 to 100 um thick. 
The slides were stored in the dark at 4°C to preserve 
fluorescence. 


Confocal Microscopy. Confocal microscopy was 
performed on the whole muscle samples with a Bio- 
Rad MRC1024 confocal imaging system mounted 
on a Nikon Eclipse inverted microscope. With a Z- 
axis focus motor and a 60x oil-immersion objective, 
the muscles were optically sectioned by scanning 
through the depth of the muscle until clear 2-dimen- 
sional color projection images of axons, receptor 
clusters, and axon terminals were produced. The full 
width of the muscle specimens was examined from 
top to bottom and edge to edge without knowledge 
of animal age group or denervation status; thus, the 
microscopist was blinded to animal group status. Dig- 
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Fig 1. Normal confocal digital images of neuromuscular junctions (NMJs) from 2 different young thyroarytenoid 

cles shown at different magnifications. Receptor clusters appear in red, axons and synaptic vesicles in blue, and S n l 
and nerve terminals in green. Scale bars are in micrometers. A) Original x108. B) Original x180. Colors wer fi 
printing to reflect those observed on computer monitor during data analysis. 








ital images of 30 unique NMJs per muscle at 180x NMJs from young muscles were typically characte1 
were collected. For each animal, the 30 NMJs repre- ized by well-defined receptor clusters and terminal 
sented the first superficial NMJs detected in a mi- regions. A positive linear relationship was observes 
croscopy session that were relatively en face with between terminal size and receptor cluster size in both 
the confocal microscope. the young and old groups (regression slope = 0.26; | 
: oe = .002). The log-log slope of 0.26 suggested that re 
Measurement. An investigator (G.K.S.) blinded to 8- 08 5 Ol R i 
; f = ; ; ceptor cluster area did not change as rapidly as tet 
animal group status, who was also not involved in : ee np : 
es ; f i minal area. Specifically, receptor cluster area changed 
the animal surgeries or the confocal microscopy, per- 2? 
Se . l ; eo ea i at only about the quartic root of the rate of terminal 
formed computer-assisted image analysis of the digi- A : Sr 
: ae = area. This was likely an allometric effect due to the 
tal micrographs (Photoshop 5.5). The following mea- cing Bes. ve 
= À Bi a different geometries of the two structures 
surements were made of superficial NMJs: 1) axon ss 
amansi araa lennita Pion Staves pane ey ee ; : : 
terminal area (square micrometers); 2) receptor clus Representative regression plots of the association 
e areas `; ar Ae atare. 2 ‘tr ` A SP g- " a = f 3 
ter area (square micrometers); 3) structural degrada of terminal and receptor cluster areas for | animal in 
tion of receptor clusters (percent); 4) nerve terminal each of the young, old, and denervated groups at 
withdraws al from receptor clusters (percent): and 5) found in Fig 2. It can be observed that a progressive 
terminal Schwann cell processes, which are precur- increase in variability about the regression line is ap 
sors to axon terminal sprouts (percent). parent in the old and denervated animals relative to 
m i a 3 A 5 T) ani “ Ihe ariahili cq i d 
Statistical analyses were performed by analysis of the young animal. W hen val lability was quantified 
variance with SAS General Linear Models Proce- as the standard deviation of the residual distance, o1 
dure (SAS Institute, Inc, Cary, North Carolina). Pair- root mean square error, significé ntly greate iabi 
; : : ‘as > A group than ın the young 
wise comparisons of the young and old groups were AN e Longa E ins oe LA group than n ung 
then performed by t-test comparisons of population (p = .02). Accordingly, receptor cluster area was sul 
: antially re variable > rats than th g 
marginal means (least squares means). stantially more variable in the old rats than baits 
rats for a given terminal area. The intercepts and 
RESULTS slopes of the relationship between terminal area and 
receptor cluster area were not significant! erent 
Axons, nerve terminals, and receptor clusters were between the young and old groups (p > .05) 
easily identified in the rat TA muscles as widely dis- 
tributed throughout the length and width of the mus- A significant reduction in axon termina 1 was 


cle. As suggested by the NMJ images in Fig 1, the found in the old muscle samples relative to the young 
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Fig 2. Relationship between receptor cluster area and ter- 
minal area is shown for representative animals in young, 
old, and denervated groups (units are logi0 of area in 
square micrometers). There is increased variability about 
regression line in old animal, which is also found in de- 
nervated animal, as compared to young animal. 


samples (p = .02; Fig 3), and the reduction in the 
size of aged receptor cluster areas approached sig- 
nificance (p = .057; Fig 3). In contrast to the young 
animals, receptor clusters in the older animals often 
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Fig 3. Nerve terminal area was significantly reduced (p 
= .02) in older group of animals, and smaller receptor 
area in older animals approached significance (p = .057). 
Error bars represent standard deviations. 


appeared diffuse, distributed over a large area, an- 
nular in character, or punctate (Fig 4). However, as 
shown in Fig 5, these qualitative observations of re- 
ceptor cluster degradation were not significantly dif- 
ferent in the young animal group versus the old ani- 
mal group. As also documented in Fig 5, nerve termi- 
nal withdrawal from receptors was significantly more 
frequent in the old muscle group (p = .007), and there 
was a tendency for more terminal Schwann cell pro- 
cesses among the old muscles, but this comparison 
did not reach statistical significance. A digital micro- 
graph of an old NMJ with terminal Schwann cell pro- 
jections and axonal sprouting is shown in Fig 6. 


Similarities were noted between the older NMJs 
and those obtained from the denervated muscles. An 
example of an NMJ from a denervated TA muscle is 
shown in Fig 7 to demonstrate the altered receptor 
cluster architecture and apparent axon terminal break- 
down observed in the denervated specimens. The 
similarity in end plate remodeling between the old 
and denervated muscles can be appreciated in Figs 6 
and 7. 

DISCUSSION 


It is estimated that 6 to 15 million elderly Ameri- 
cans suffer from disruptions in swallowing, and the 
number of elderly persons with communicative im- 
pairments may be much larger.33-37 Swallowing dis- 
orders, in particular, have a large impact on general 
health, nutritional status, quality of life, and rehabili- 
tative potential after injury or illness. In aged popu- 
lations, 50% to 68% of nursing home residents are 
estimated to have disruptions in swallowing,*338 with 
associated risks of aspiration pneumonia, hospital- 
ization, and death.39-4! Accordingly, a thorough un- 
derstanding of potential mechanisms underlying dys- 
phagia in the elderly appears critical. 


Despite the major clinical problems of dysphagia 
and communicative dysfunction in the elderly, there 
has been relatively little work providing insight into 
the age-related sequelae of sarcopenia in muscles of 
the upper aerodigestive tract. Although the few exist- 
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Fig 4. Confocal digital images of NMJs from 2 different old TA muscles (original x180). Receptor clusters appear in re ) 
and synaptic vesicles in blue, and Schwann cells and nerve terminals in green. NMJs in old TA muscles were oft 

ized by A) punctate receptor clusters distributed across wide area or B) annular receptor cluster structure. Sca 
micrometers. Colors were adjusted for printing to reflect those observed on computer monitor during data analysi 


ing studies are suggestive of age-related neuromus- 
cular disorganization in laryngeal muscle,?2-23-25.26 
there is a dearth of direct knowledge concerning NMJ 
morphology.*4 Inferences must be made from work 
in limb musculature. However, differences in inner- 
vation structure among limb and laryngeal muscles 
underscore the importance of direct observation of 
NM Js within the larynx. In the TA muscle, for exam- 
ple, innervation ratios are much lower than those in 
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Fig 5. Schwann cell processes, axonal sprouts, and re- 
ceptor cluster degradation were more frequent in old 
muscles, but differences were not statistically significant. 
However, 10.5% of receptor clusters in old muscles were 
unoccupied by axon terminals, whereas nerve terminal 
withdrawal was not observed in any of young muscles (p 
= .007). Error bars represent standard deviations. 


limb muscles, and multiple innervations of single 
muscle fibers from the same axon have been re 
ported.?4-42-44 Interestingly, the frequency of such 
multiple innervations was lower in older muscle spec 
imens.?4 These findings suggest that the biologically 


important, hard-wired neuromuscular redundancy 
found in the TA muscle may be compromised in the 
aged, and may reflect a natural denervation-reinner 
vation process. 

The focus of this investigation was to quantify the 
anatomic manifestations of aging at the NMJ in rat 
TA muscle, as related to the process of natural dener 
vation-reinnervation. Our goal was to identify occur 
rences of presynaptic and postsynaptic modifications 








in aged TA NMJs. In general, we met this goal by 

quantifying significant differences in end plate archi 

tecture between young and old groups of animals 
Neuromuscular junctions were easily identified in 


all muscles under confocal microscopy as widely dis 


tributed across the full length and width of the muscle 
specimen, and not confined to an end plate band or 
clustered together, as is observed in limb muscle. This 
finding has been reported by others and has been at 

tributed to the complex geometry of the TA muscle.4 
In the young muscles, well-defined nerve terminal 
and receptor regions were identified. However, in the 
old muscles, a significant reduction in nerve termi 





nal area was found, in comparison with the young 
group. There was also a significant increase in the 





number of receptor clusters unoccupied by nerve ter 
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Fig 6. NMJ from denervated rat TA muscles demonstrated 
nerve terminal disorganization and receptor cluster an- 
nularity. Nerve terminal sprouts (blue) and terminal 
Schwann cell projections (green) are shown at arrow 
heads. Scale bar is in micrometers. Colors were adjusted 
for printing to reflect those observed on computer moni- 
tor during data analysis. 


minals in the old muscles. These findings are com- 
plementary in that nerve terminal withdrawal likely 
would be preceded by progressive reductions in nerve 
terminal area. In addition, we found increased vari- 
ability in the relationship between terminal area and 
receptor cluster area in the aged muscle. The in- 
creased variability in end plate structure may be re- 
flective of generalized neuromuscular disorganiza- 
tion in the old TA muscles. 


Other observations were suggestive of age-related 
alterations in end plate structure, but did not reach 
statistical significance. These observations included 
increased terminal Schwann cell projections, which 
are precursors to axon terminal sprouting,’ and recep- 
tor cluster degradation in the old muscles. Quite prob- 
ably, the failure to achieve statistical significance was 
due to inadequate statistical power, which could be 
overcome with the inclusion of more animals in each 
group. In this study, the sample size was constrained 
by the availability of old animals from the National 
Institute on Aging contract colonies. Similarly, the 
age of the old animals in this study, 24 months, was 
constrained by availability. Because F1 hybrid ani- 
mals have a potential life span of up to 42 months,’ 
it would be interesting to determine whether a pro- 
gression in NMJ alteration would be observed in ani- 
mals older than 24 months. The question of progres- 
sive changes will be addressed in future studies. 





Fig 7. NMJ from old muscle demonstrates receptor clus- 
ter degradation and terminal Schwann cell projections 
(arrowheads). Scale bar is in micrometers. Colors were 
adjusted for printing to reflect those observed on com- 
puter monitor during data analysis. 


Intact receptor sites and nerve terminals were ob- 
served in the young TA muscles. The young muscles 
tended to have uniform receptor cluster density across 
the muscle, and nerve terminals occupied all recep- 
tor sites. In contrast, increased frequency and sever- 
ity of receptor cluster and nerve terminal remodel- 
ing in the old TA muscles were typically observed. 
These alterations in NMJ morphology mirrored the 
changes observed in the 2 denervated animals, includ- 
ing widespread nerve terminal degeneration, termi- 
nal Schwann cell projections, axonal sprouting, and 
receptor cluster remodeling, with an annular or punc- 
tate appearance of receptor clusters. These results are 
highly suggestive of a natural denervation-reinner- 
vation process at work in peripheral motor synapses 
as animals age. 

In summary, NMJs in laryngeal muscle undergo 
significant alterations as a function of aging that are 
similar to those reported for denervation. As our pop- 
ulation ages, strategies for treating elderly persons 
with dysphagia and communicative impairment will 
be of crucial importance. The results of this study 
increase our understanding of the basic morphologi- 
cal substrates of peripheral age-related changes in 
the larynx, and may serve as a baseline to test the 
effectiveness of particular interventions, such as ca- 
loric restriction*® and exercise. Because of the seri- 
ous and pervasive problems encountered in the aero- 
digestive tracts of elderly individuals, it is of great 
importance to add to the knowledge base in this area. 
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INDUCTION OF ENDOLYMPHATIC HYDROPS BY DIRECTLY INFUSED 
MONOCLONAL ANTIBODY AGAINST TYPE II COLLAGEN CB11 


PEPTIDE 
HIROSHI MATSUOKA, MD, PHD 
OTSU, JAPAN 
SOON SEOG KWON, MD, PHD YOSHIRO YAZAWA, MD, PHD 
SEOUL, KOREA OTSU, JAPAN 
MARCO BARBIERI, MD _ T.J. Yoo, MD, PHD 
GENOA, ITALY MEMPHIS, TENNESSEE 


` Autoimmune ear disease induced by type II collagen has been investigated by Yoo et al. In the present study, we investigated the 
effects of direct chronic infusion of type II collagen—specific monoclonal antibodies into guinea pig cochlea. Type II collagen frag- 
ment CB11 peptide-specific monoclonal antibodies (anti-CB11 Mabs) were infused directly into the scala tympani of guinea pigs 
with an Alzet mini-osmotic pump (anti-CB11 Mab group). As a control, normal mouse serum was infused by the same method (con- 
trol group). To evaluate the auditory function, we recorded brain stem auditory-evoked potentials (BAEPs). In the anti-CB11 Mab 
group, 80% of the animals showed an increased hearing threshold of more than 25 dB at 7 days after infusion. The hearing threshold 
shift observed in the guinea pigs of the control group was minimal (15 dB or less). To detect the structural changes, we performed. 
histopathologic studies using hematoxylin and eosin staining. Inflammatory cell migration was detected mainly in the scala tympani 
of the guinea pigs of both groups. In the anti-CB11 Mab group, endolymphatic hydrops was also observed. The results of this ex- 
periment suggest that type II collagen autoimmunity is responsible for the production of hearing loss and endolymphatic hydrops. 


KEY WORDS — autoimmune ear disease, CB11 peptide-specific monoclonal antibodies, direct chronic infusion, endolym- 
phatic hydrops, type II collagen. 


INTRODUCTION was also another study that did not support type II 

Since we described a type II collagen-induced au- COllagen autoimmunity in rats immunized with bo- 
toimmune ear disease (CIAED) model, many stud- vine type II collagen.* Therefore, it seems of great 
ies have been done to develop animal models of importance that we investigate the effects of directly 
CIAED.!~ Rats immunized with bovine type H colla- infused specific antibodies against type II collagen 
gen developed inner ear lesions such as spiral gan- in the cochlea. Cyanogen bromide peptide 11 (CB11 
glion degeneration and perineural vasculitis.? Rats peptide), which is cleaved from type If collagen by 
also developed otospongiosis-like lesions, salpingi- cyanogen bromide, is thought to play an important 
tis, arthritis, and auricular chondritis.34 In addition, role in collagen-induced arthritis and inner ear dis- 
guinea pigs immunized with bovine type II collagen ease in mice.%-!0 It is not known, however, whether 


developed endolymphatic hydrops.5 Although there this peptide is also important in the production of 
are some discrepancies in the observed findings, these CIAED in guinea pigs. Putting these findings to- 


animal models showed strong evidence that sup- gether,we planned to investigate the effects of direct 
ported the possibility of autoimmunity against type chronic infusion of CB11 peptide-specific monoclo- 
I collagen.! nal antibodies (anti-CB11 Mabs) into guinea pig co- 
Defans ; : chleas. 

Moreover, there is clinical and experimental evi- 
dence of increased antibody activities against type 
II collagen in serum and perilymph in patients or ani- MATERIALS AND METHODS 
mals with autoimmune ear disease. 7 However, there Animals. Fourteen guinea pigs of the Hartley strain 
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(weighing 500 to 600 g) were used. Six animals were 
used for infusion of anti-CB11 Mabs, and 8 were 
used as controls. 


Preparation of Anti-CB11 Mabs. Anti-CB11 Mabs 
were purified from the supernatants of number 6, 10, 
21, and D12G B cell hybridoma clones. These clones 
were grown in RPMI medium 1640 supplemented 
with 10% fetal calf serum, 1 mmol/L sodium pyru- 
vate, 100 ug/mL penicillin, 100 g/mL streptomy- 
cin, 2 mmol/L glutamine, 5 x 10-5 mmol/L B-mercap- 
toethanol, 20 mmol/L Hepes buffer (pH 7.4), and 1x 
nonessential amino acids. The RPMI and nonessen- 
tial amino acids were purchased from Irvine Scien- 
tific (Santa Ana, California), and the B-mercaptoetha- 
nol was from Sigma (St Louis, Missouri). A total of 
1.5 Lof supernatant containing anti-CB11 Mabs was 
purified through protein A Sepharose CL-4B (Phar- 
macia Biotech AB, Uppsala, Sweden). According to 
the manufacturer’s recommendations, the superna- 
tant was passed through a protein A Sepharose col- 
umn equilibrated with 20 mmol/L phosphate buffer 
(pH 7.0) and followed by 0.1 mol/L glycine buffer at 
pH 3.0 as an elution buffer. The eluted immunoglo- 
bulin G (IgG) was neutralized by 1 mol/L Tris hydro- 
chloride buffer (pH 9.0). The Mabs were dialyzed in 
phosphate buffer (pH 7.0) and lyophilized. The anti- 
body concentration was determined by 12% sodium 
dodecyl sulfate—-polyacrylamide gel electrophoresis. 


Osmotic Pump Implant. A fine-tipped cannula was 
made as described by Kingma et al!! for cochlear 
infusion. An Alzet mini-osmotic pump (model 2002, 
ALZA, Palo Alto, California) with a fine-tipped can- 
nula was implanted in each animal according to the 
method described by Brown et al.!? Six guinea pigs 
were given an intraperilymphatic chronic infusion 
(incessant infusion for a long time) of anti-CB11 
Mabs. As a control, normal mouse serum was in- 
fused in 8 guinea pigs by the same method. The anti- 
CB11 Mabs and normal mouse serum were infused 
directly into the scala tympani of the left ear of each 
animal through a fine-tipped cannula for 14 days at 
0.5 L/h. As a control, normal mouse serum was in- 
fused in 8 guinea pigs by the same method. 


Recording of BAEPs. Under anesthesia of keta- 
mine hydrochloride (85 mg/kg) and xylazine hydro- 
chloride (17 mg/kg), the brain stem auditory-evoked 
potentials (BAEPs) were recorded by a Nicolet CA- 
1000 system (Nicolet Biomedical Instruments, Madi- 
son, Wisconsin). Investigation of the BAEPs was 
made with the guinea pig in a prone position in a dark, 
electrically shielded box. The body temperature was 
kept stable at approximately 37°C with a heating pad. 
Auditory stimuli were applied through an earphone 
(MDR-ES15, Sony, Japan) that was placed 2 cm from 


the right (or left) ear canal. The response signals were 
recorded with electrodes connected with a screw 
fixed in the skull. The active electrodes were placed 
in the vertex (connected with a screw) and in the 
right (or left) retrocochlear region and grounded on 
the back. The sound stimulus consisted of 100-ms 
clicks at a repetition rate of 11.4/s. The evoked audi- 
tory potentials were amplified x104, filtered with a 
bandpass between 150 and 1,500 Hz, and averaged 
from a total of 500 click-evoked responses for the 
first 10 ms following stimulation. The stimulus inten- 
sity (decibels peak equivalent sound pressure level) 


was calibrated with a sound level meter (model 800B, 


Larson Davis, North Provo, Utah). The hearing thresh- 
old was determined as the lowest intensity for a vis- 
ible wave I. The measurement of BAEPs of 14 ani- 
mals was performed immediately after pump implan- 
tation (on day 0) and on days 3, 7, 10, and 14 after 
surgery. 


Statistical Analysis. Statistical analyses were per- 
formed by the Mann-Whitney U test. 


Histopathologic Studies. After painless sacrifice 
under deep anesthesia, the temporal bones of 11 ani- 
mals were removed after the observation period of 
14 days for histopathologic studies. The temporal 
bones were fixed in 10% neutralized formaldehyde 
buffer solution, and then they were decalcified in 10% 
ethylenediaminetetraacetic acid, dehydrated in alco- 
hol, and embedded in melted paraffin. The paraffin 
blocks were sectioned at 6 um and stained with he- 
matoxylin and eosin. The sections of all 11 animals 
were examined with a photomicroscope, and immu- 
nohistologic examinations were done with anti-mouse 
IgG. 


RESULTS 


Observation of Middle Ear. After the temporal 
bones were removed, the left bulla of each animal 
was exposed and examined to check for the pres- 
ence of middle ear infection. On examination, 1 guin- 
ea pig of the anti-CB11 Mab group and 2 guinea pigs 
of the control group had middle ear infection, and 
they were excluded from this experiment. The result 
was that 5 animals in the anti-CB11 Mab group and 
6 in the control group went on to the final evalua- 
tion. 


Hearing Thresholds. In the anti-CB11 Mab group, 
4 of the 5 guinea pigs showed increased hearing 
thresholds of 5 to 15 dB 3 days after implantation. 
Therefore, the mean hearing threshold had deterio- 
rated 3 days after implantation in the anti-CB11 Mab 
group (Fig 1). Seven days after implantation, all 5 of 
the animals showed increased hearing thresholds of 
more than 15 dB, and the shift of the mean hearing 
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Fig 1. Changes in hearing thresholds of CB11 peptide— 
specific monoclonal antibody (anti-CB11 Mab) group 
(squares). Mean hearing thresholds deteriorated 3 days 
after implantation in anti-CB11 Mab group. At 7 days 
after implantation, all 5 animals showed increased hear- 
ing thresholds of more than 15 dB, and shift of mean 
hearing threshold approximated more than 20 dB at this 
time. These changes continued until end of study. Circles 
— control group; pe — peak equivalent. 


threshold approximated more than 20 dB at this time. 
These changes continued until the end of the study. 


The hearing threshold shifts observed in the 6 guin- 
ea pigs in the control group were minimal (15 dB or 
less). No obvious change in the mean hearing thresh- 
old was noted in the control group during the follow- 
up period of 14 days (Fig 2). 


Fig 3. Cochlear morphology. A) 
Mild endolymphatic hydrops and 
pronounced inflammatory cell 
migration in scala tympani are ob- 
served in guinea pig of anti-CB11 
Mab group (original x30). B) In 
guinea pig of control group, mild 
inflammatory cell migration is ob- 
served, but endolymphatic hy- 
drops is not observed (original 
x40). 
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Fig 2. Changes in hearing threshold of control group. No 
obvious change in mean hearing threshold was noted on 
either side in control group during observation period. 
Squares — experimental ear; circles — contralateral ear. 


By the Mann-Whitney U test, the difference in the 
means of the hearing threshold of the experimental 
ear between these two groups was significant at 7, 
10, and 14 days after implantation. 


On the other hand, there was no obvious change 
in the mean hearing threshold of the contralateral side 
in the two groups (Figs | and 2). In addition, the 
mean hearing thresholds of the contralateral ears of 
the two groups were very similar. 


Morphological Changes. All 5 animals of the anti- 
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CB 11 Mab group developed endolymphatic hydrops. 
One guinea pig (animal 6) of the anti-CB11 Mab 
group showed moderate endolymphatic hydrops.!3 
This animal also demonstrated marked inflammatory 
cell migration in the scala tympani in all of the co- 
chlear turns and slight cell migration in the scala ves- 
tibuli and scala media. The others (animals 1, 3, 7, 
and 8) in the anti-CB11 Mab group showed slight 
endolymphatic hydrops!3 (Fig 3A). These animals 
also had mild inflammatory cell migration only in 
the scala tympani (Fig 3A). Inflammatory cell mi- 
gration was observed mainly in the vicinity of the 
basilar membrane and spiral ligament in the scala 
tympani and in the neighborhood of the spiral liga- 
ment and Reissner’s membrane in the scala vestibuli. 
Vacuolization of the cells of the stria vascularis and 
cochlear vasculitis were seen sporadically. On the 
other hand, the sections of the control group showed 
mild inflammatory cell migration in the scala tympani 
and occasionally in the scala vestibuli. There was no 
endolymphatic hydrops in the guinea pigs of the con- 
trol group (Fig 3B). They showed inflammatory cell 
migration in the lumen of the endolymphatic sac. 





Fig 4. Endolymphatic sac (original x50). Inflam- 
matory cell infiltration is observed in lumen of en- 
dolymphatic sac in guinea pigs of A) anti-CB11 
Mab group and B) control group. 





Inflammatory cell infiltration was also observed 
in the lumen of the endolymphatic sac in the anti- 
CB11 Mab group (Fig 4A) and the control group (Fig 
4B). 

DISCUSSION 


An autoimmune response to the components of 
connective tissue, in particular type II collagen, has 
long been recognized as important in arthritis, but it 
has also been proposed as important in human ear 
disease.° Type II collagen is known to be the major 
component of cartilage, but immunohistochemical 
studies of ear tissue in rodents have revealed its dis- 
tribution in the endochondral layer and globuli inter- 
ossei of the otic capsule, osseous spiral lamina, spi- 
ral ligament, limbus, tectorial membrane, endolym- 
phatic duct, and other tissue structures. !+!8 


The involvement of ear structures in autoimmune 
responses to type II collagen has been investigated 
in several related rodent models. Type H collagen— 
induced ear lesions in Lewis rats and guinea pigs 
were first reported by Yoo et al in 198325 and 1984.7 
However, Harris et al,® in 1986, did not observe these 
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phenomena. Soliman!? reexamined this controversi- 
al issue in 1990 and demonstrated impaired hearing 
thresholds and morphological changes, endolymphat- 
ic hydrops, slight vasculitis of the cochlear vessels, 
and slight degeneration of the spiral ganglion cells. 
Huang et al?! reproduced type II collagen-induced 
bone resorption in the temporal bone and salpingitis 
in rats, thus confirming our previous observations in 
contrast to the negative report by Bretlau et al.?? Cruz 
et al? also reproduced spiral ganglion degeneration 
in guinea pigs with type II collagen immunization. 
These animals had an increase of latency in wave I. 
Tomoda et al?4?5 reproduced type II CIAED in guinea 
pigs in separate experiments. The results of this study 
further confirm the original observations of Yoo et 
al. Soliman, however, pointed out that there was 
quantitative and qualitative variation in his study us- 
ing type II collagen—immunized animals. Therefore, 
it seems of great importance that we investigate the 
effects of directly infused specific antibodies against 
type II collagen in the cochlea. In this study, then, 
we aimed to examine the effects of anti-CB11 Mabs 
in the guinea pig cochlea. 


In this way, we were able to induce endolymphatic 
hydrops in nearly 100% of the animals in the anti- 
CB11 Mab group. There may be some arguments 
about the results because of the small number of ani- 
mals evaluated in this study; even so, we thought 
they were meaningful. In addition, all guinea pigs in 
the anti-CB11 Mab group developed increased hear- 
ing thresholds. These threshold shifts seem to be to 
the same degree reported by Soliman.!? On the other 
hand, endolymphatic hydrops was not noted in the 
control group, and the hearing threshold shifts ob- 
served in these animals were minimal. These results 
support the appearance of endolymphatic hydrops 
due to immunologic reactions induced by anti-CB 11 
Mabs in the cochlea in the guinea pig model. 


It is also suggested that direct injury of the inner 
ear by anti-CB11 Mabs causes increased hearing 
thresholds. Therefore, the results of this study sug- 
gest that type II collagen autoimmunity is responsi- 
ble for the production of hearing loss associated with 
endolymphatic hydrops in the guinea pig animal mod- 
el. 
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IMMUNOHISTOCHEMICAL LOCALIZATION OF SUBTYPES OF 
MUSCARINIC RECEPTORS IN HUMAN INFERIOR TURBINATE 
MUCOSA 
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The regulation of glandular secretions and vasomotor tone in human nasal mucosa implicates muscarinic receptors. There are 5 
recognized classes (m1 through m5) of muscarinic receptor subtypes, and the aim of our study was to localize muscarinic receptor 
subtypes (m1 through m5) in human inferior turbinate mucosa by an immunohistochemical method. We found m1 and m2 receptors 
distributed on glands, arteries, veins, and epithelia; m4 receptors were found around arteries; and m5 receptors were identified on 
glands and arteries. We found m3 receptors to be the most extensively distributed on glands, arteries, and veins of all of the musca- 
rinic receptor subtypes. The m3 receptor is probably important in the physiology of the human inferior turbinate. This study may help 
identify the best target for more selective muscarinic drugs and guide the treatment of allergic and nonallergic rhinitis. 


KEY WORDS — cholinergic innervation, human nasal mucosa, inferior turbinate mucosa, muscarinic receptor, receptor sub- 


type. 
INTRODUCTION 


The parasympathetic nervous system regulates 
glandular secretion! and vasomotor tone’? in the 
nasal mucosa, especially in the inferior turbinate. Na- 
sal challenge with cholinergic agonists such as metha- 
choline produces hypersecretion in normal, allergic, 
and nonallergic rhinitis subjects,!0-!? and this reac- 
tion is inhibited by anticholinergic drugs such as at- 
ropine and ipratropium. These findings indicate that 
muscarinic receptors mediate glandular secretion. 


Muscarinic receptors are cell surface receptors that 


participate in a wide variety of physiological actions. 


There are 4 pharmacologically recognized classes 


(M1 through M4) of muscarinic receptor subtypes,!>_ 


and 5 muscarinic receptor genes (m1 through-m5) 
have been identified.'4 These subtypes differ in their 
tissue distributions, agonist- and antagonist-binding 
affinities, structures, and functions. The m1 and m3 
receptor subtypes were characterized pharmacologi- 
cally in human inferior turbinate by radioligand bind- 
ing assays,!>-!6 and localization of only m3 receptor 
messenger RNA in human inferior turbinate was iden- 
tified by use of in situ hybridization.'’ But the im- 
munologic localization of muscarinic receptor sub- 
types in the human inferior turbinate has been un- 
known. 


The object of our study was to identify the distri- 
bution of muscarinic receptor subtypes (m1, m2, m3, 
m4, and m5) in the human inferior turbinate by an 
immune structure dyeing method. 


MATERIALS AND METHODS 


Inferior Turbinate Mucosa Samples. Inferior tur- 
binate specimens were received from 10 patients (8 
men, 2 women; 20 to 53 years old; mean age, 31.5 
years) who required surgery for nasal obstruction (hy- 
pertrophic concha or septal deviation). Tissue sam- 
ples for histochemical analysis were fixed in 4% para- 
formaldehyde in 0.1 mol/L phosphate-buffered sa- 
line solution (PBS) for 6 hours at 4°C, cryoprotected 
in sucrose solution for 48 hours, and stored at —-80°C 
until processing for immunostaining. 


Immunocytochemistry. The frozen sucrose-embed- 
ded specimens were cut with a cryostat into sections 
of 10-um thickness at —24°C and mounted on gela- 
tin-coated slides. The cryosections were air-dried for 
30 minutes. In order to suppress the activity of en- 
dogenbus peroxidase of red blood cells and other in- 
flammatory cells, all sections were pretreated with a 
solution containing 4N hydrochloric acid and 100% 


SEMIQUANTITATIVE SCORES FOR STAINING OF 
MUSCARINIC RECEPTOR SUBTYPES IN HUMAN 
INFERIOR TURBINATE MUCOSA 











Turbinate Site ml m2 m3 m4 mS 
Epithelium + + + 0 0 
Glands -+ + +++ 0 

Arterioles ++ + +++ + + 
Veins and veinlets ++ + ++ 0 0 
Cavernous sinuses + 0 + 0 0 


0 — negative; + — recognizably positive; ++ — clearly positive; 
+++ — excessively positive. 
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Fig 1. Nerve fibers immunoreactive for m1 receptors in human inferior turbinate mucosa (arrows). A) Vein (v). B) Arteries (a). 


C) Glands (g). D) Epithelium (e). Bar — 50 um. 


methanol. Serial sections were incubated with rab- 
bit polyclonal antibodies (Research and Diagnostic 
Antibodies, Berkeley, California) for m1 (dilution 
1:2,000), m2 (dilution 1:2,000), m3 (dilution 1:2,000), 
m4 (dilution 1:500), and m5 (dilution 1:500) for 24 
hours at 4°C. Controls consisted of omission of the 
primary antibody in the staining procedure and incu- 
bation of serial sections with rabbit immunoglobu- 
lin G (Vector Labs, Burlingame, California) diluted 
to 1:500. The sections were treated with avidin-bio- 
tinylated horseradish peroxidase complex reagents 
(Vector Labs) and then stained with 0.001% 3-,3'di- 
amino-benzidine with 0.03% hydrogen peroxide. Fi- 
nally, all sections were mounted with Kaiser’s glyc- 
erin gelatin, evaluated, and documented by light mi- 
croscopy. 

Semiquantitative Analysis. The findings in all ex- 
perimental groups, each containing the previously 
noted morphological structures with recognizable 
positive staining, were subjected to semiquantitative 
scoring. The values used were 0 points for negative, 
l point (+) for slightly positive, 2 points (++) for 
clearly positive, and 3 points (+++) for extremely 
positive staining. All data were averaged and rounded. 


RESULTS 


The Table presents the localization of muscarinic 
receptor subtypes (m1 through m5) in human infe- 
rior turbinate. Nerve fibers immunoreactive for m1, 
m2, and m3 were observed around veins (Figs 1A, 
2A, and 3A). Nerve fibers immunoreactive for m4 
and m5 could not be detected around veins. The m3- 
immunoreactive nerve fibers were most excessively 
positive around venous vessels, and m1- and m3-im- 
munoreactive nerve fibers were slightly distributed 
in the cavernous sinuses. Immunoreactive nerve fi- 
bers for all muscarinic receptor subtypes (m1 through 
m5) were found around arterioles (Figs 1B, 2B, 3B, 
4, and 5B), with copious m3-immunoreactive nerve 
fibers around arterioles. The nasal glands were inner- 
vated by ml-, m2-, m3-, and m5-immunoreactive 
nerve fibers (Figs 1C, 2C, 3C, and 5A). No m4-im- 
munoreactive nerve fibers were found in the nasal 
glands, and m3-immunoreactive nerve fibers were 
most richly distributed in the nasal glands. In the epi- 
thelium, there were small numbers of m1-, m2-, and 
m3-immunoreactive nerve fibers (Figs 1D, 2D, and 
3D). Nerve bundles of various sizes immunoreactive 
for ml, m2, and m3 were seen in many parts of the 
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Fig 2. Nerve fibers immunoreactive for m2 receptors in human inferior turbinate mucosa (arrows). A) Vein (v). B) Artery (a). 
C) Glands (g). D) Epithelium (e). Bar — 50 um. 





Fig 3. Nerve fibers immunoreactive for m3 receptors in human inferior turbinate mucosa (arrows). A) Vein (v). B) Artery (a). 
C) Glands (g). D) Epithelium (e). Bar — 50 um. 
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Fig 4. Nerve fibers immunoreactive for m4 receptors in 
human inferior turbinate mucosa (arrows). Bar — 50 um; 
a — artery. 


mucosa (data not shown), and m4- and m5-immu- 
noreactive nerve bundles were not found in the inter- 
stitium. 

DISCUSSION 


Our study demonstrated localization of muscarin- 
ic receptor subtypes in human inferior turbinate mu- 
cosa by an immunohistochemical method. Receptors 
for m1, m2, and m3 are distributed on arteries, veins, 
submucosal glands, and epithelia. Ishii? described the 
cholinergic innervation of human inferior turbinates 
in glands, arteries, and veins by histochemical distri- 
bution of cholinesterase activity. The immunohisto- 
chemical distributions of muscarinic receptors were 
proportionate to the distribution of cholinesterase ac- 
tivity on glands and blood vessels. In our study, we 
found a sparse presence of m1, m2, and m3 receptors 
in the epithelium, and a slight presence of both m1 
and m3 in the sinusoids. Autoradiograms indicated 
pharmacologically that M1 and M3 receptors are dis- 
tributed densely on submucosal glands, arteries, and 
veins, and are distributed moderately to sparsely on 
sinusoids and epithelial cells,'> although according 


to Ishii, the epithelial layer failed to show cholines- 
terase activity. Cholinesterase is only somewhat spe- 
cific for cholinergic nerves and thus is not necessar- 
ily a precise indicator of cholinergic innervation. The 
parasympathetic nervous system is believed to play 
an essential role in the airway defense system. Mus- 
carinic receptors are involved in modulation of cili- 
ary action in the respiratory system.!* The selective 
m1 receptor antagonist pirenzepine dihydrochloride 
and the selective m3 receptor antagonist 4-diphenyl- 
acetoxy-N-methylpiperidine methiodide significantly 
blocked methacholine-induced ciliostimulation; this 
finding indicates that ml and m3 receptors are in- 
volved in the transduction of the muscarinic recep- 
tor signal responsible for ciliary stimulation. !° How- 
ever, there was no significant inhibition of metha- 
choline-induced ciliary stimulation by the selective 
m2 receptor antagonist gallamine — a finding dem- 
onstrating that m2 receptors are not involved in stim- 
ulation of respiratory cilia by methacholine.!? The 
function of muscarinic receptors in the epithelium 
may be modulation of ciliary action in nasal mucosa. 
We found a sparse presence of m2 receptors around 
glands, arteries, and veins. However, Okayama et al!5 
stated that M2 receptors did not appear in the human 
inferior turbinate on pharmacological study. This dis- 
parity in results may have occurred because an im- 
munohistochemical approach is more precise than a 
pharmacological one in the localization of receptors. 
Our study localized m4 and m5 receptors in human 
inferior turbinates: m4 receptors were found around 
arteries, and m5 receptors were distributed on glands 
and arteries. The exact function of m4 and m5 re- 
ceptors in the human inferior turbinates needs to be 
elucidated. 


The m3 receptors were most extensively distrib- 
uted on glands, arteries, and veins in all muscarinic 
receptor subtypes. The m3 receptor is probably im- 





Fig 5. Nerve fibers immunoreactive for m5 receptors in human inferior turbinate mucosa (arrows). Bars — 50 um. A) Glands 
(g). B) Artery (a). 
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portant in the physiology of the human inferior tur- 
binate. This study may help identify the best targets 


for more selective muscarinic drugs and guide the 
treatment of allergic and nonallergic rhinitis. 
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OTOLOGIC FACIAL PALSY: ETIOLOGY, ONSET, AND SYMPTOM 
DURATION 


ELLEN KVESTAD, MD 
KARI J. KVÆRNER, MD, PHD TAIN W. S. MAIR, MD, PHD 
OSLO, NORWAY 


To estimate the occurrence of otogenic facial palsy, we performed a retrospective case record study of all patients hospitalized 
for otogenic facial palsy in the period 1989 to 1999 at Ullevål University Hospital, which is the only referral hospital for patients with 
otologic sequelae in Oslo. The facial palsy was a complication of acute otitis media in 10 patients (56%), of acute mastoiditis in 3 
patients (17%), of secretory otitis media in 3 patients (17%), and of chronic otitis media in 2 patients (11%). In half of the patients, 
complete facial palsy was found at the time of diagnosis. Sixteen patients (89%) reported a gradual onset of the facial palsy. The 
mean duration of otologic symptoms before the onset of facial palsy was 3 days (range, 1 to 9 days), and the median time to remission 
was 9 weeks (range, 2 to 96 weeks). Total remission was achieved in all patients who received follow-up. Although most patients 
recover within a few weeks, some patients have long-lasting facial palsy. Multicenter studies are needed to increase the sample size 
and to identify predictors of facial palsy duration. 


KEY WORDS — facial palsy, mastoiditis, otitis media. 


INTRODUCTION Most clinical studies have focused on the relation- 
ship between medical and surgical treatment and the 
subsequent duration of the facial palsy. Because of 
the low prevalence of facial palsy, most materials 
have been too small for us to draw significant con- 
clusions. Further, cases of otogenic facial palsy due 
to acute otitis media in children*7,° and cases due to 
chronic otitis media in adults are normally pre- 
sented separately. Because Ullevål University Hos- 
pital is the only referral hospital for otologic com- 


The various clinical manifestations of otitis me- 
dia are considered to be different stages of the same 
disease,!2 and facial palsy is a possible complica- 
tion in all stages. Because of the reduction in intra- 
temporal and intracerebral complications after the 
introduction of antibiotics, otogenic facial palsy is 
at present rare,?> being reported in less than 1% of 
patients with otitis media.>- 


Although the mechanism of facial palsy develop- plications in Oslo, the incidence of facial palsy can 
ment is not known, 3 pathophysiological theories be estimated. The present study, including all patients 
have been postulated: 1) a middle ear infection theory, hospitalized for facial palsy during an 11-year peri- 
in which the palsy is caused by osteitis and neuritis od, describes the diagnosis and treatment of otogen- 
and subsequent nerve entrapment, 2) a bacterial toxin ic facial palsy. It particularly focuses on causation 
theory, in which the palsy is caused by demyeliniza- and the time sequence between the onset of otologic 
tion and impaired nerve conduction, and 3) an inflam- symptoms and the presentation and duration of fa- 
mation theory in which the palsy is caused by throm- cial palsy. Finally, predictors for prolonged duration 
bosis of the vasa nervorum and subsequent ischemia. of facial palsy are discussed. 

Consequently, treatment recommendations have fo- 
cused on both antibiotic treatment and surgery, in- MATERIALS AND METHODS 


cluding myringotomy, mastoidectomy, and nerve de- 
compression,” to reestablish the physiological state 
of the facial nerve. 


The study population included all patients with 
otologic facial palsy referred to Ullevål University 
Hospital in Oslo, a city of approximately 550,000 
Clinical studies have been performed in an effort inhabitants, during the period 1989 to 1999. The case 


to find predictors for early facial palsy remission. It records of 18 patients were analyzed retrospectively 
has been claimed that the duration of facial palsy is for descriptive and predictive factors for facial palsy 
prolonged in children with complete facial palsy,’ duration. The data on age cohort, gender, age, otolog- 
with an increased probability of permanent sequelae.° ic symptoms, and hematologic findings were regis- 
The proportion of patients with permanent sequelae tered from the case sheets. The inclusion criterion for 
varies from 0% to 30%.3.7-9 the study was subtotal or total facial palsy of otolog- 
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Fig 1. Age distribution of Oslo children with otogenic 
-facial palsy from 1989 to 1999 by gender (N = 18). 


ic origin. The presence of fever (>38°C), otalgia, otor- 
rhea, and otologic and facial nerve function was reg- 
istered at the time of hospitalization. The clinical di- 
agnosis of facial palsy was based on the graded func- 
tion of the musculus frontalis, musculus orbicularis 
oculi, and musculus orbicularis oris. Additionally, in- 
formation on increased or decreased tear flow and 
changes in taste or salivation was collected. Addition- 
al factors in the study were the otologic cause of the 
facial palsy and the palsy’s characteristics and dura- 
tion. An acute onset of facial palsy was defined as 
a symptom onset and progression within the first 
24 hours, and a subacute onset of facial palsy was 
defined as a symptom progression after 24 hours or 
more. The surgical and medical treatment, the du- 
ration of the facial palsy, and the sequelae were reg- 
istered. The total duration of the facial palsy was de- 
fined as the total time from symptom onset to max- 


imum recovery of facial palsy. All treatment as- 
sessments were performed by one of the authors 
(I.W.S.M.). 


Proportions were used to describe the study popu- 
lation by cause and duration of facial palsy. The me- 
dian was used to assess age at the time of diagnosis, 
average symptom duration prior to diagnosis, and to- 
tal duration of facial palsy. Because Ullevål Univer- 
sity Hospital is the referral hospital for patients with 
facial palsy and patients with otologic sequelae tra- 
ditionally treated as inpatients in public hospitals in 
the Norwegian health system, the incidence of com- 
plications can be estimated. Further, we used a sub- 
sample of children who presented with facial palsy 
to estimate the complication rate of facial palsy in 
acute otitis media by 2 years of age. As the denomi- 
nator, the incidence of acute otitis media by 2 years 
of age estimated in a prospective cohort of Oslo chil- 
dren (41.8%) was used.!° For this part of the study, 
only children 2 years old or younger were included. 


RESULTS 


Disease Distribution. In an 11-year period, 18 pa- 
tients presented with facial palsy of otologic cause 
— an annual incidence of 0.34 per 100,000 inhabi- 
tants. Seven children presented with facial palsy by 
2 years of age. Based on an acute otitis media inci- 
dence of 41.8% by 2 years of age,!° the complica- 
tion rate of facial palsy was 0.55%. The age distribu- 
tion of the 13 male patients and 5 female patients in 
the study is shown in Fig 1. There was a bimodal age 


TABLE 1. CHARACTERISTICS OF 18 PATIENTS HOSPITALIZED FOR OTOGENIC FACIAL PALSY IN 1989 TO 1999 BY 


























CAUSE OF PALSY 
Acute Secretory Chronic 
Acute Otitis Mastoiditis Otitis Media Otitis Media All 
Media (n = 10) (n= 3) (n = 3) (n= 2) (N = 18) 

Cohort 1989 0 0 0 0 0 
1990 2 0 0 0 2 

1991 1 1 1 0 3 

1992 0 0 0 0 0 

1993 3 2 1 1 7 

1994 2 0 0 0 2 

1995 1 0 0 0 1 

1996 1 0 1 1 3 

1997 0 0 0 0 0 

1998 0 0 0 0 0 

1999 0 0 0 0 0 

Age Children (<16 y) 8 2 3 0 13 
Adults 2 1 0 2 5 

Sex Male 7 2 3 1 13 
Female 3 1 0 1 5 

Facial palsy Complete 6 1 0 2 9 
Incomplete 4 2 3 0 9 

Facial palsy onset Acute 2 0 0 0 2 
Subacute 8 3 3 2 16 
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TABLE 2. SYMPTOM DISTRIBUTION, DIAGNOSTIC FINDINGS, AND TREATMENT OF FACIAL PALSY BY 
CAUSE OF PALSY 
Acute Secretory Chronic 
Otitis Acute Otitis Otitis 
Media Mastoiditis Media Media All 
(n=10) (n=3) (n=3) (n=2) (N= 18) 
Symptoms Fever Yes 5 1 3 0 9 
No 5 2 0 2 9 
Otalgia Yes 5 2 2 1 10 
; No 5 1 1 1 8 
Otorrhea Yes 4 2 0 1 7 
No 6 1 3 1 11 
Subjective hearing loss Yes 4 3 0 2 9 
No 6 0 3 0 9 
Increased tear flow Yes 4 0 1 2 7 
No 6 3 2 0 11 
Decreased tear flow Yes 1 0 0 0 1 
No 9 3 3 2 17 
Changed taste or salivation Yes 0 0 0 0 0 
No 10 3 3 2 18 
Otoscopic findings Tympanic membrane on Abnormal 8 3 3 2 16 
affected side Normal 2 0 0 0 2 
Tympanic membrane in Abnormal 4 0 1 0 5 
contralateral ear Normal 6 3 2 2 13 
Hematologic findings C-reactive protein <20 mg/mL 4 1 2 2 9 
>20 mg/mL 4 2 0 0 6 
White blood cell count <10 x 10°/L 2 0 2 2 6 
(leukocytes) >10 x 10°/L 6 2 1 0 9 
Facial palsy Complete 6 1 0 2 9 
Incomplete 4 2 3 0 9 
Medical treatment Intravenous antibiotics Benzylpenicillin 4 1 0 0 5 
Ampicillin 1 0 0 0 1 
Amoxicillin 2 0 0 0 2 
Cefuroxime 2 2 1 1 6 
Erythromycin 1 0 0 0 1 
None 0 1 2 1 4 
Surgical treatment Myringotomy Yes 7 l 2 0 10 
No, perforated 3 2 0 1 6 
No, not performed 0 0 1 1 2 
Mastoidectomy Yes 0 2 0 1 3 
No 10 i 3 1 15 





distribution, with median ages of 2 years and 45 years 
among the children and adults, respectively. The char- 
acteristics of facial palsy by cause are shown in Table 
i, 


Clinical Course. The otologic diagnosis was based 
on history, presenting symptoms, and otoscopic, he- 
matologic, and surgical findings. Fever, otalgia, otor- 
rhea, hearing loss, and increased tear flow were pre- 
senting symptoms in approximately half of the study 
population; only 1 patient experienced decreased tear 
flow (Table 2). At examination, 16 patients presented 
with rubor, retraction, bulging, or perforation of the 
tympanic membrane in the affected ear, and 5 in the 
contralateral ear. Elevated white blood cell counts 
and C-reactive protein levels were registered in 9 and 
6 patients, respectively. Computed tomographic scan- 
ning of the temporal bone was performed in 8 pa- 





tients and showed normal findings in 1 patient (with 
chronic otitis media), bone resorption compatible 
with cholesteatoma in 1 patient, and opacification of 
the middle ear and mastoid cells in 6 patients. The 
distributions of symptoms, diagnoses, and treatments 
by cause are shown in Table 2. 


Diagnostic myringotomy was performed in all pa- 
tients with acute otitis media or acute mastoiditis who 
presented without otorrhea. Before hospital admis- 
sion, 16 patients had received antibiotic treatment. 
During the hospital stay, all patients but 4 received 
intravenous antibiotics. During hospitalization, 6 
were given cefuroxime sodium, 5 phenoxymethyl 
penicillin, 2 amoxicillin, 1 ampicillin sodium, and 1 
erythromycin ethylsuccinate. The median hospital 
stay was 4 days. Mastoidectomy was performed in 3 
patients: 2 patients with acute mastoiditis and 1 pa- 
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Acute otitis media 


Fig 2. Distribution of ear symptom duration Secretory otitis media 


before onset of facial palsy by otogenic cause 
of palsy. 


Acute mastoiditis 


Chronic otitis media 


tient with chronic otitis media and cholesteatoma. 


Onset, Duration, and Sequelae of Facial Palsy. > 
Fifty percent of the patients presented with complete 
facial palsy (Table 1). The 2 patients with acute on- 
set of facial palsy recovered completely within 2 and 
4 weeks (Table 1). The onset of facial palsy in the 
remaining 16 patients was subacute. The mean time 
between the onset of otologic symptoms and the fa- 
cial palsy was 3 days, and the range was 1 to 8 days 
(Fig 2). In all patients but 1, the facial palsy was 
preceded by otitis media. 


The mean duration of facial palsy was 9 weeks 
(range, 2 to 96 weeks). Because all patients recov- 
ered completely from the facial palsy, the observa- 
tion time was identical to the facial palsy duration. 
Rapid remission of facial palsy (8 weeks or less) was 
more common in children than in adults (Table 2). 
Also, prolonged duration of facial palsy was more 
common in male patients. Figure 3 shows that al- 


- though most patients recovered within a few weeks, 


some patients in each etiologic category experiencd 
a prolonged duration. Among patients with acute oti- 
tis media, 7 of 10 had a symptom duration of longer 
than 8 weeks. Two of the 3 patients with secretory 


-otitis media and 1 of the 2 patients with chronic oti- 


tis media recovered within 8 weeks or less. Neither 
the completeness of the facial palsy nor the length of 
the otologic symptoms prior to the onset of facial 
palsy had any impact on the subsequent duration of 
the facial palsy. The distribution of facial palsy cases 
by symptom duration is shown in Table 3. 


The 3 patients with a facial palsy duration of 30 
weeks or more were male (Fig 3); 2 had complete 
facial palsy at the time of diagnosis, and all recov- 
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ered without sequelae. Two had acute otitis media, 
and { had acute mastoiditis. All had received antibiot- 
ics before hospitalization, presented with normal he- 
matologic findings, and received parenteral cefurox- 
ime sodium. 


DISCUSSION 


In our study, we found that facial palsy is a rare 
complication of otitis media that more commonly af- 
fects male children with acute otitis media. Most pa- 
tients recover within a few weeks without sequelae. 
The duration of facial paralysis could not be predicted 
by the characteristics of the facial palsy or by the 
otologic or hematologic findings on presentation. 


The annual incidence rate was 0.34 per 100,000, 
which is somewhat lower than the 0.50 per 100,000 
found in a Danish study? that only comprised facial 
palsy as a complication of acute otitis media. Fur- 
ther, there was an increased probability of facial palsy 
in young children, particularly boys.!! It has been 
suggested that the causal factor is immaturity of the 
immune system and subsequent vulnerability to in- 
fections.!! In particular, boys are at higher risk for 
being otitis-prone.!° However, the age-specific inci- 
dence of otitis media may explain the high propor- 
tion of facial palsy in small children. Age-specific 
incidence estimates of acute otitis media in children 
by 2 years of age showed that the facial palsy com- 
plication rate in young children was only 0.54%. A 
corresponding figure of 0.40% in children younger 
than 16 years was reported in a Danish study.3 


The introduction of antibiotics has changed the 
outlook on the prognosis and treatment of otitis me- 
dia, including complications and sequelae.”.?:!2 Be- 








Acute otitis media 


Secretory otitis media 


Acute mastoiditis 


Chronic otitis media 


8 
è 
fe 
8 


Fig 3. Distribution of duration of facial palsy 
by otogenic cause. 


= 
8 
— 
N 
o 





602 Kvestad et al, Otogenic Facial Palsy 


TABLE 3. PROLONGED DURATION OF FACIAL PALSY 
IN PATIENTS HOSPITALIZED IN 1989 TO 1999 BY 
DIAGNOSIS AND DEGREE 


Facial Facial 

Palsy Palsy 
Duration Duration 

SB wk >8 wk All 
(n=10) {n=8) (N= 18) 





Age Children (<16 y) 9 4 13 

Adults 1 4 5 

Sex Male 5 8 13 

Female 5 0 5 

Diagnosis Acute otitis media 6 4 10 

Acute mastoiditis 1 2 3 

Secretory otitis media 2 1 3 

Chronic otitis media 1 1 2 

Facial Complete 5 4 9 

palsy Incomplete 5 4 9 

Facial Acute 3 0 3 

palsy Subacute 7 8 15 
onset 


cause of the low frequency of facial palsy, large clini- 
cal studies are difficult to perform.” A problem with 
limited sets of patients and inadequate follow-up is 
that the available information is too limited to draw 
inferences from the study results. In contrast to other 
studies that focused on facial palsy as a complica- 
tion of either acute or chronic otitis media,*+®!3 the 
present study comprises information on all patients 
with otogenic palsy within an 11-year period. For 
instance, it has been claimed that myringotomy, mas- 
toidectomy, and subsequent nerve decompression are 


necessary in patients with a duration of facial palsy 
of 2 weeks or more.!! In our study, all patients who 
were followed up recovered completely, although 
mastoidectomy was performed in only 3 patients, and 
nerve decompression was not performed in any. 


Most likely, the varying practice of medical and 
surgical treatment of these patients*6!! is due to the 
lack of predictors for facial palsy remission. Earlier 
studies have reported that complete palsy on presen- 
tation is a poor prognostic factor,’ in particular when 
decreased tearing is present. Further, the rates of se- 
quelae reported vary from 0% to 30%.35.9 In our study, 
characteristics of facial palsy presentation, such as 
completeness, acute or delayed onset, or duration of 
ear symptoms prior to paralysis, did not affect the 
duration rate or the remission rate. 


Although the majority of patients recovered within 
a few weeks, predictors for prolonged duration of 
palsy were not found. Because of the low incidence 
of disease complications, multicenter studies are 
needed to increase the sample size and identify pre- 
dictors for facial palsy duration. 


CONCLUSION 


Otologic facial palsy is rare, and occurs in the ma- 
jority of cases in male children as a complication of 
acute otitis media. All patients in this study recov- 
ered quickly and without sequelae. The degree of 
palsy and the duration of symptoms prior to the oti- 
tis did not predict the duration of paralysis. 
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DIGITAL PHOTOGRAPHY OF THE LARYNX 


BRUCE BENJAMIN, FRACS 
SYDNEY, AUSTRALIA 


Digital photographic technology has ushered in a new era in medical documentation. The author’s method of still digital photog- 
raphy for direct and indirect laryngeal photodocumentation using a single-lens-reflex camera and rigid rod-lens telescopes is de- 
scribed, and the results are presented. The advantages and disadvantages of different systems using still or video photography are 
discussed, and the importance of storing the archival image securely is emphasized. It is concluded that digital photography is 
convenient and versatile and gives good-quality images. It is the way of the future. 


KEY WORDS — digital photography, larynx. 


There have been many milestones in the develop- 
ment of photography of the larynx. French,! of New 
York, using sunlight reflected from a head mirror, is 
credited with taking the earliest black-and-white pho- 
tographs; his paper “On a Perfected Method of Pho- 
tographing the Larynx” was published in 1884. In 
1919, Garel? described a method for taking stereo- 
scopic pictures of the larynx, and by 1925, Clerf? had 
reported his method of photography. Beginning in 
the 1940s, Paul Holinger*} set the standard for accu- 
rate, precise documentation using the ingenious, pur- 
pose-constructed Holinger-Brubaker camera, which 
by modern standards was awkward and heavy. In 
1968, Kleinsasser® published his landmark textbook 
“Microlaryngoscopy and Endolaryngeal Microsur- 
gery,” containing color photographs taken with a pho- 
toadapter on the beam-splitter of an operating micro- 
scope. Others?-!! have featured photographic meth- 
ods applied with varying degrees of success, mostly 
to direct laryngoscopy and to a lesser degree to indi- 
rect laryngoscopy. 


Hopkins rigid rod-lens wide-angle telescopes! in- 
crease light transmission, magnify the image, and 
provide an almost infinite depth of focus. The optics 
in these telescopes offer an exciting method of both 
still and dynamic documentation — a method that 
can be applied wherever telescopes are used. Many 
papers, atlases, and textbooks!3-20 have featured the 
results of pictorial photographic techniques using 
both flexible and rigid fiberoptic instruments. 


The advent of still and video digital photographic 
technology begins a new era. It is timely to present 
the author’s experience in the growing field of still 
digital photography as applied to direct and indirect 
laryngeal documentation. The advantages and disad- 
vantages of the different systems that are available 





and their most appropriate applications will be dis- 
cussed. 


MATERIALS AND METHODS 


Cameras. The standard technique for laryngeal 
photography!!.17 uses a single-lens-reflex (SLR) cam- 
era and 35-mm color film. Digital cameras, however, 
do not require film, because the photographic infor- 
mation is captured and stored on a memory card, of 
which there are different types (CompactFlash, Smart- 
Media, Memory Stick, etc) for different cameras. Un- 
wanted images can be quickly deleted from these 
cards to’ preserve available memory space. Digital 
still cameras?! are commonly classified as either com- 
pact “point-and-shoot” cameras or more expensive 
SLR cameras. For this study, the Sony DSC-S70 (Fig 
1A) and the Canon EOS-D30 (Fig 1B,C) were cho- 
sen to represent these two formats. 


The Sony DSC-S70 features a 3x optical zoom lens, 
automatic exposure control, and automatic white bal- 
ance, uses a Memory Stick recording medium, has a 
preview-review liquid crystal display (LCD) screen, 
is powered by a 7.2-V lithium ion battery, and weighs 
approximately 420 g. It has a charge-coupled device 
(CCD) image sensor with 3.3-megapixel capability. 
There are 6 preset focus zones plus an automatic fo- 
cus setting, but there is no manual focus. A Sony ex- 
tension ring VAD-S70 with a 52- to 55-mm step-up 
ring and a 55-mm Karl Storz quick-connect adapter 
560QC are required for attachment to the proximal 
end of a Hopkins telescope (Fig 1A). 


The Canon EOS-D30 camera body features auto- 
matic and manual exposure settings and automatic 
white balance, uses a CompactFlash recording medi- 
um, has a review-only LCD screen, is powered by a 
7.4-V lithium ion‘battery, and weighs approximately 
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800 g (without lens). It has a complementary metal 
oxide semiconductor (CMOS) image sensor with a 
3.25-megapixel capability. The compact, relatively 
lightweight (for an SLR camera) camera body readily 
accepts interchangeable Canon lenses. Two lenses 
were used in this study. The first was a focusable 
fixed-focal-length Canon 50-mm (equivalent to 80- 
mm on a regular 35-mm camera) lens that required a 
52- to 55-mm step-up ring and a 55-mm Karl Storz 
quick-connect adapter 560QC (Fig 1B). The second 
was the well-known focusable Karl Storz 70- to 140- 
mm zoom lens that attached directly to the proximal 
end of a Hopkins telescope (Fig 1C). 


Recording Format. All photographs were taken in 
JPEG (Joint Photographic Experts Group) compres- 
sion recording format with an image size (an indica- 
tion of image resolution) of either 2,048 x 1,536 pix- 
els or 1,600 x 1,200 pixels for the Sony (using Mem- 
ory Stick) and 2,160 x 1,440 pixels for the Canon 
(using a CompactFlash card). 

Telescopes. For direct laryngoscopy, rigid rod-lens 
telescopes 0°, 10-mm external diameter (Karl Storz 
8701AG or 8711AA) were used for most photographs, 
and smaller-diameter 4.0-mm or 5.6-mm telescopes 
were used for pediatric patients. Telescopes angled 
at 30° were sometimes used. For indirect laryngos- 
copy in adults,” a rigid rod-lens telescope, 70°, 8 x 
9-mm-—external diameter (Karl Storz 8706CA) was 


Fig 1. Various camera setups. A) 
Sony DSC-S70 “point-and-shoot” 
camera with extension ring, step- 
up ring, and quick-connect adap- 
ter for attachment to rigid tele- 
scope. B) Canon EOS-D30 “‘sin- 
gle-lens-reflex” camera body with 
50-mm fixed-focal-length Canon 
lens, step-up ring, and quick-con- 
nect adapter for attachment to 
rigid telescope. C) Canon EOS- 
D30 “single-lens-reflex” camera 
body with Karl Storz 70- to 140- 
mm zoom lens for attachment di- 
rectly to rigid telescope. 





used in this study, but any of the telescopes in rou- 
tine use would be satisfactory. 


Laryngoscopes. For an overall view of the laryn- 
gopharynx at direct laryngoscopy under general an- 
esthesia, where possible, an adult Lindholm laryngo- 
scope or one of the two Benjamin-Lindholm pediat- 
ric laryngoscopes was used. In patients in whom ex- 
posure was difficult, an appropriate laryngoscope was 
chosen according to personal preference. 


Lighting. For 35-mm film endoscopic photogra- 
phy, synchronized electronic flash with automatic ex- 
posure control is required.!!-!7 Electronic flash can 
also be used with digital cameras, but because digi- 
tal cameras have responsive light sensors, they work 
well with a modern, bright light source — xenon (pre- 
ferably) or halogen — to provide continuous avail- 
able light; there is then no need for expensive flash 
apparatus. 


White Balance. The proportions of the primary col- 
ors red, green, and blue vary according to the color 
temperature of the light source. White balance is a 
system of color correction to adjust the brightest part 
of the scene as a white point and a basis for different 
light conditions to ensure a natural color range in 
the captured image. Digital cameras provide a num- 
ber of choices to set white balance. In this study the 
automatic white balance setting was allowed to se- 
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lect the correct color temperature for pictures of the 
mucosal surfaces of the larynx during use of any of 
the lighting methods described above. 


Exposure. Exposure depends on a combination of 
3 factors: shutter speed (here usually chosen arbi- 
trarily as 1/90 second), film speed (here usually cho- 
sen as ISO 200), and the brightness of the light (which 
is varied by a control knob on the light source). At 
times, one or more of these factors were adjusted to 
improve exposure. After a photograph was taken, the 
captured image was reviewed on the camera LCD 
screen, and to ensure satisfactory exposure, changes 
were made when needed in subsequent photographs. 
Initial trial and error quickly provided experience to 
achieve correct exposure. Bracketing of exposure 
above and below what seems satisfactory is an op- 
tion with the Canon D30, which features automatic 
exposure bracketing as a selection in the dial-up menu. 
For 3 successive frames, the camera will automati- 
cally change the exposure level within a range up to 
plus or minus 2 stops in half-stop increments in the 
following order: correct exposure, underexposure, and 
overexposure. Unwanted photographs are simply de- 
leted. 


The International Standards Organization (ISO) 
speed rating is a well-known index of photosensitiv- 
ity of regular silver-halide film. Although digital cam- 
eras do not use film, the photosensitivity of their im- 
aging sensor is nevertheless commonly described as 
having an “ISO equivalent” setting. The Canon D30 
can be altered in the range from 100 through 1,600 
ISO. Large-diameter telescopes and bright light 
sources allow settings of ISO 100 or 200 to be used 
effectively. The lower the speed setting, the sharper 
the image. The higher the setting, the more “noise” 

(or electrical interference), and the less sharp the im- 
age (Fig 2B). 

Focus. It was found that focus was judged best 
with the Sony DSC-S70 used in the “automatic” mode 
(rather than any of the other 6 focus settings) and by 
reference to the preview-review screen. As the study 
progressed, it became apparent that because of the 
absence of a manual mode, focusing with the Sony 
was imprecise and difficult to control, although when 
the field happened to be in focus, an acceptable pho- 
tograph (Fig 2A) was obtained. Once this unpredict- 
ability was recognized and confirmed, further work 
with the Sony was abandoned in favor of the reliabil- 
ity and sense of sureness that had come with further 
experience with the Canon SLR. 


Because the exact field to be photographed was 
seen in the pentaprism viewfinder, focus was more 
precise with the SLR camera. Rigid telescopes have 
a wide depth of field, but in addition, the operator 


needs to use the normal manual focus ring on the 
lens for fine focus. The Canon camera, as it comes 
from the factory, is fitted with a frosted focusing 
screen. To assist clear viewing at endoscopy, it is bet- 
ter to replace this with a plain screen to provide op- 
timum brightness in the viewfinder and enhance the 
ability to focus accurately. Our original screen was 
modified by a precision lens maker. 


Printing. The digital image information was down- 
loaded to a computer either directly from the camera 
via an appropriate connection cord or by removing 
the memory card from the camera and placing it in a 
card-reader for downloading images. Special high- 
gloss photo-quality ink-jet paper is necessary for best 
results. 


Image Manipulation. The images captured by digi- 
tal cameras are likely to benefit from some degree of 
adjustment of contrast, brightness, color, and hue and 
by application of “sharpening” provided by a good 
photo retouching software program. However, this 
article is not the place to describe the many software 
printing programs that are available. The reader 
should choose a photo image-editing application ac- 
cording to personal requirements; it does not need to 
be a program with advanced features. All digital cam- 
eras come supplied with some kind of editing pro- 
gram. A high-resolution ink-jet printer is required for 
best results. Images like those in Fig 2 can be adjusted 


- in Adobe Photoshop and printed in Microsoft Pub- 


lisher on a Hewlett Packard Deskjet 990Cxi printer. 


RESULTS 


Selected examples of digital photographs taken at 
indirect and direct laryngoscopy in pediatric and adult 
patients are shown in Fig 2. Good images were ob- 
tained with the Canon SLR camera (Fig 2B-H) with 
both the Karl Storz and the Canon lenses using man- 
ual focus; a summary of the method is shown in the 
Table. In low-light conditions with a speed setting 
of 1,600 ISO, unwanted “noise” is seen (Fig 2B). It 
proved difficult to consistently ensure sharp focus 
with the Sony point-and-shoot camera, which has au- 
tomatic focus but no manual focus (Fig 2A). It seems 
likely, however, that good results could be obtained 
with one of the many other point-and-shoot cameras 
that do have manual focus capability. 


DISCUSSION 


Certain advantages and disadvantages of digital 
photography can be identified. Important intraopera- 
tive and data processing advantages include compat- 
ibility with current lighting equipment, ease of adjust- 
ment of settings, instant playback and review, deliv- 
ery of a printed image within minutes, capacity to 
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RECOMMENDED METHOD FOR STILL DIGITAL 
PHOTOGRAPHY 


Canon D30 SLR camera body 
Lens 








Karl Storz 70- to 140-mm zoom 
Canon 50-mm fixed focal length and quick-connect 
Telescope 
10 mm, 0° for direct laryngoscopy 
Choose high image resolution setting 
Optimize laryngeal exposure 
Exposure 
Bright continuous xenon (best) or halogen light or flash 
'/o0-second exposure time 
ISO 200 film speed setting 
Frame, manually focus, and take photograph 
Examine review screen and adjust exposure factors 





enter many images on a large file size memory card, 
and prompt electronic integration into patient’s notes 
and hospital records. Other advantages include im- 
mediate deletion of unwanted information; rapid per- 
manent storage and filing for record-keeping, organi- 
zation and recall; versatility to save, print, e-mail, and 
transfer into various programs such as Powerpoint; 
imprint of data regarding time, date, exposure de- 
tails and file number; facility to edit, crop, alter size, 
color, contrast, etc, of the original image; and no need 
for film or chemical processing. 


Important disadvantages include the initial capital 
cost of the digital camera, computer, software, and 
printer, the reality that the “best” system incorporat- 
ing any of these items may become outdated in a 
short time, and the learning curve necessary for com- 
puter literacy. Other disadvantages include the cost 
of inks and photo-quality paper, the fact that a digi- 
tal image can be “lost,” the water-solubility of printed 
images unless special water-resistant paper or lami- 
nation is used, and the fact that digital photographs 
appear unlikely to be accepted in a court of law. 


The capital outlay for digital equipment is out- 
weighed over time by substantial savings on film, 
photographic processing, and filing. Storage of digi- 
tal images on a recordable CD that holds up to 650 
megabytes (more than 100 high-resolution images) 
is inexpensive, convenient, safe, and long-lasting. 
Moreover, the stored images do not undergo the de- 
terioration of picture quality that can occur with slide 
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Fig 3. Flow chart emphasizes importance of archival im- 
age. 


transparencies because of fading of colors, humid- 
ity, and fungal growth. 


So far, 35-mm film photography maintains its abil- 
ity to produce a marginally superior image. With digi- 
tal photography, the definition is not quite as sharp 
and is often described as “near 35-mm quality.” In 
some instances, digital images may not be as good 
as 35-mm photographs for publication, teaching, re- 
search, lecture presentation, or display of a poster. 
However, because digital images can be manipulated 
and presented so easily, lectures are often generated 
from a laptop or personal computer with Powerpoint 
or similar software and projected by an LCD projec- 
tor direct to a screen. In addition, journal editors and 
publishers will increasingly accept digitally stored 
images without the need to make hard prints. After 
all, such prints are scanned into digital media at the 
pre-press stage during publication. 


Camera resolution much beyond 2 megapixels is 
not distinguishable by the human eye on a computer 
screen, and at this level “pixellation” is visually de- 
tectable only in a printed image larger than A4 size 
(210 x 297 mm). It should be noted, therefore, that 
as consumer digital cameras push on to 5 megapixel 
and more capability, it is not entirely necessary to 
upgrade capacity to these levels for medical photo- 
documentation. 


Digital photodocumentation with a miniature tele- 


< Fig 2. Digital photographs. A) Taken with Sony DSC-S70. B-J) Taken with Canon EOS-D30. K) Taken with AIDA system. 
A) Laryngeal papillomas in small child; reasonably good focus with Sony camera. B) Hypoplasia of epiglottis in infant. With 
Canon at film speed of 1,600 ISO, shows “noise” (interference) pattern. C) Close-up of congenital interarytenoid web in 
infant. D) Normal structures at indirect laryngoscopy (reoriented with anterior commissure at top). D1) Base of tongue and 
valleculae. D2) Epiglottis and posterior larynx. D3) Glottis and anterior subglottis. E) Mass of laryngeal papillomas. F) Close- 
up of small papilloma on right arytenoid area. G) Small polyp on left membranous vocal fold. H) Severe glottic web following 
injudicious removal of laryngeal papillomas. I) Posterior glottic stenosis caused by prolonged intubation trauma. J) Squamous 
cell carcinoma. K) Overall view of larynx with Benjet anesthesia tube in place. Posterior glottic stenosis is present, but cannot 
be seen in this view taken with Karl Storz AIDA photodocumentation video system. 
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vision camera attached to either a flexible or a rigid 
endoscope is already widely used in clinics, offices, 
and operating rooms. Prints can be obtained from an 
image “frozen” on the video screen or from recorded 
material. During operations such as microlaryngos- 
copy or endoscopic sinus surgery, it is convenient to 
simply print from a “real-time” image on the televi- 
sion monitor or to record continuously on videotape 
or CD. The procedure does not need to be interrupted 
to obtain still photographs. With a good 3-chip CCD 
television camera, recorded. still images are good 
enough for most purposes, including presentation on- 
to a screen and publication. 


A number of systems are now available that print 
digital images directly to a patient record sheet. The 
system familiar to the author is the Advanced Image 
and Archiving (AIDA) System made by Karl Storz. 
It consists of a 3-chip miniature camera and televi- 
sion monitor linked to a Pentium III processor with 
mouse, keyboard, recordable CD, color printer, and 
supplied software to edit and produce still images 
(Fig 2K). The database records patient-specific de- 
tails, history, and investigation results and has a facil- 
ity to read-in data such as digitized x-ray or ultra- 
sound images. During a surgical procedure, notes are 
recorded and selected images are imprinted before 


the patient leaves the operating room. Such a system 
facilitates immediate archiving of one or more oper- 
ative endoscopic images, which can be conveniently 
stored, retrieved, e-mailed, incorporated into a report 
or letter, or accessed later for presentation. Storage 
options include the computer hard drive, floppy disk, 
zip drive, or recordable CD. The latter is cheap, con- 
venient, and portable and appears to be the best cur- 
rent option. 


This type of system is the most useful and practi- 
cal in operating rooms and in clinics. Whether its 
source is a video camera or a still camera, the cap- 
tured archival image is the critical centerpiece of digi- 
tal photography (Fig 3) and, as already described, 
can be used in many ways. 


Digital photodocumentation with a small televi- 
sion camera, considering its ready availability, con- 
venience, and all-round versatility, is the way of the 
future. Still photographs obtained with a 3-chip cam- 
era are suitable for routine purposes and are often 
good enough for publication. Still digital photogra- 
phy with an SLR camera, as detailed in this paper, is 
not likely to be extensively used except by those who 
seek excellence in photographic documentation. An 
SLR camera using either 35-mm film or digital re- 
cording media currently provides the best image. 


REFERENCES 


1. French TR. On a perfected method of photographing the 
larynx. NY State J Med 1884;40:653-6. 


2. Garel J. Nouvel appariel perfectionee pour la photo- 
graphie stereoscopique du larynx sur le vivant. Rev Laryngol 
1919;40:249-53. 


3. Clerf LH. Photography of the larynx. Ann Otol Rhinol 
Laryngol 1925;34:101-21. 

4. Holinger PA. Photography of the larynx, trachea, bronchi 
and esophagus. Trans Am Acad Ophthalmol Otolaryngol 1942; 
46:153-6. 

5. Holinger PA, Brubaker JD, Brubaker JE. Open tube, prox- 


imal illumination mirror and direct laryngeal photography. Can 
J Otolaryngol 1975;4:781-5. 


. 6. Kleinsasser O. Microlaryngoscopy and endolaryngeal mi- 
crosurgery. Philadelphia, Pa: WB Saunders, 1968. 


7. Tardy ME Jr, Tenta LT. Laryngeal photography and tele- . 


vision. Otolaryngol Clin North Am 1970;3:483-92. 
8. Edens ET. Bronchoscopy in pictures. Bracknell, England: 
Boehringer Ingelheim, 1972. 


9, Padovan IF, Christman NT, Hamilton LH, Darling RJ. 
Indirect microlaryngostroboscopy. Laryngoscope 1973;83:2035- 
4l. 


10. Ward PH, Berci G, Calcaterra TC. Advances in endoscop- 
ic examination of the respiratory system. Ann Otol Rhinol Lar- 
yngol 1974;83:754-60. 


11. Benjamin B. Technique of laryngeal photography. Ann 


Otol Rhinol Laryngol Suppl 1984;93(suppl 109). 


12. Hopkins HH. Optical principles of the endoscope. In: 
Berci G, ed. Endoscopy. New York, NY: Appleton-Century- 
Crofts, 1976:3-26. 


13. Benjamin B. Atlas of paediatric endoscopy: upper respi- 
ratory tract and oesophagus. London, England: Oxford Univer- 
sity Press, 1981. 


14. Becker W, Buckingham RA, Holinger P, Steiner W, Jau- 
mann MP, Messerklinger W. Atlas of otorhinolaryngology and 
bronchoesophagology. Philadelphia, Pa: WB Saunders, 1984. 


15. Benjamin B. Diagnostic laryngology: adults and children. 
Philadelphia, Pa: WB Saunders, 1990. 

16. Yanagisawa E, Yanagisawa R. Laryngeal photography. 
Otolaryngol Clin North Am 1991;24:999-1022. 


17. Benjamin B. Art and science of laryngeal photography. 
Ann Oto! Rhinol Laryngol 1993;102:271-82. 


18. Benjamin B, Bingham B, Hawke M, Stammberger H. A 
color atlas of otorhinolaryngology. London, England: Martin 
Dunitz, 1995. 


19. Yanagisawa E. Color atlas of diagnostic endoscopy in 
otorhinolaryngology. New York, NY: Igaku-Shoin, 1997. 


20. Benjamin B. Endolaryngeal surgery. London, England: 
Martin Dunitz, 1998. 


21. http://www.imaging-resource.com. Accessed 2002. 
22. Benjamin B. Indirect laryngeal photography using rigid 
telescopes. Laryngoscope 1998;108:158-61. 





Ann Otol Rhinol Laryngol 111:2002 


HISTOPATHOLOGIC STUDY OF THE TEMPORAL BONES AND 
EUSTACHIAN TUBES OF CHILDREN WITH CHOLESTEROL 
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Six temporal bone-eustachian tube (ET) specimens with cholesterol granuloma (CG), obtained from 6 children 6 months to 15 
years of age, were studied histopathologically to obtain further information about the pathogenesis of CG. We observed CG in the 
mastoid air cells in 5 ears, the mastoid antrum in 1 ear, the aditus ad antrum in 2 ears, and the epitympanum in 1 ear. All 6 cases 
exhibited a large amount of remaining mesenchyme that was in continuity with the hematopoietic bone marrow in the locations in 
which CG was present. All cases demonstrated otitis media with effusion and inflammation of the ET. Apparent morphological 
abnormalities of the ET and its associated structures, including hypoplastic ET cartilage and an abnormal tensor veli palatini muscle, 
were noted in 3 of the 6 cases. Furthermore, the posterior cartilaginous portion of the ET that includes its narrowest portion was 
completely filled with effusion in 2 of the 3 cases with the ET anomaly. The findings obtained were compared with data from age- 
matched control cases. Our results suggest that the source of erythrocytes in the remaining mesenchyme is the hematopoietic bone 
marrow. The pathogenesis of CG in children is likely promoted by ET dysfunction resulting in failure of ventilation of the middle ear. 


KEY WORDS — cholesterol granuloma, eustachian tube, histopathology, mesenchyme, temporal bone. 


INTRODUCTION 


Cholesterol granuloma (CG) is a histologic term 
used to describe a chronic granuloma that represents 
a foreign body response to the aggregation of cho- 
lesterol crystals resulting from the breakdown of 
erythrocytes and certain tissue elements.! Although 
CG is found in various organs, the temporal bone is 
the most common site of detection. Cholesterol gran- 
` uloma of the temporal bone is frequently associated 
with various diseases of the ear, including otitis me- 
dia with effusion and cholesteatoma,!-3 and may be 
a difficult condition to treat. Although the pathogen- 
esis of CG remains controversial, hemorrhage, in- 
terference with clearance or drainage, and obstruc- 
tion of air exchange or ventilation of the middle ear 
(ME) system are considered to be potential impor- 
tant factors for the development of CG.! Remaining 
mesenchyme has also been reported to possibly be 
associated with the formation of CG.1-4 


The ME is known to have 2 pressure regulation 
systems: one is ventilation through the eustachian 
tube (ET), and the other is gas exchange through the 
ME, mainly via the mucosa of the mastoid cavity.>6 
The ET is the predominant pathway for clearance of 
the ME. In order to investigate the pathogenesis of 
CG, a comprehensive evaluation of the ME system, 


including the ET and mastoid cavity, is necessary. 


There are many histopathologic reports describ- 
ing CG of the temporal bone.!.2.4,78 However, little 
is known about the histopathologic findings of the 
ET and its associated structures in cases of CG. The 
purpose of this study is to present the results of his- 
topathologic examination of temporal bone—ET spec- 
imens from 6 children with CG and 8 age-matched 
controls, and to discuss the possible pathogenesis of 
this entity. 


MATERIALS AND METHODS 


We studied 60 temporal bone~ET specimens ob- 
tained at autopsy from 60 children 15 years of age or 
younger. Twenty-six of the 60 specimens exhibited 
otitis media on histologic analysis. Among the 26 
specimens, 6 contained CG; these are the focus of 
this study. The specimens were obtained from 6 chil- 
dren 6 months to 15 years of age. In addition, 8 speci- 
mens without otitis media, obtained from 8 children 
3 months to 12 years of age, were used as age-matched 
(as matched as possible) controls. They did not have 
any known anomaly or histologic evidence of ear 
diseases. Their.clinical information is summarized 
in Table 1. The specimens were fixed in 10% forma- 
lin, decalcified in 5% trichloroacetic acid, dehydrated 





From the Elizabeth McCullough Knowles Otopathology Laboratory, Division of Otopathology (Miura, Sando, Orita), and the Division of 
Otology (Hirsch), Department of Otolaryngology, University of Pittsburgh School of Medicine, Pittsburgh, Pennsylvania. This study was 
supported by research grant RO1DC00123-23 from the National Institutes of Health/National Institute on Deafness and Other Communication 


Disorders. 


CORRESPONDENCE — Isamu Sando, MD, DMSc, Division of Otopathology, Dept of Otolaryngology, University of Pittsburgh School of 
Medicine, Suite 153, Eye and Ear Institute Building, 203 Lothrop St, Pittsburgh, PA 15213. 


610 Miura et al, Histopathology of Cholesterol Granuloma 


TABLE 1. CASES OF CHOLESTEROL GRANULOMA AND CONTROLS 





Case Age 





Sex Side Cause of Death 








Otologic History 


Accompanying Disease 








Cholesterol granuloma 


l 6 mo M L Post hepatic transplantation 


2 6mo M R Cervical teratoma 

3 17mo F L Hydranencephaly 

4 Ay M L Hydrocephalus 

ae) PY F L Post hepatic transplantation 


6 I5y F L Chronic renal failure 


Controls 
T. -3me F L Sudden infant death syndrome 
8 8mo F L Septic shock 
9 12mo M L Cardiomyopathy 
10. 1l6mo M L Acute lymphocytic leukemia 
Il 4y M iL Acute lymphocytic leukemia 
12 Sy M L Acute myocarditis 
13. 8y M L Acute lymphocytic leukemia 
l4 12y M L Intracerebral hematoma 


in graded concentrations of ethanol, and embedded 
in celloidin. The celloidin blocks were sectioned at 
30 um in thickness as close to perpendicular to the 
long axis of the ET as possible.? Every 20th section 
was stained with hematoxylin and eosin and prepared 
for light microscopic study. 

We examined these sections under a light micro- 
scope to record pathological findings in each of the 
following areas of the ME and ET: 1) the mastoid air 
cells, 2) the mastoid antrum, 3) the aditus ad antrum, 
4) the epitympanum, 5) the mesotympanum and hy- 
potympanum, 6) the bony portion of the ET, and 7) 
the cartilaginous portion of the ET. The aditus ad 
antrum is defined by 2 (anterior and posterior) verti- 
cal planes perpendicular to the longitudinal axis of 
the ME cleft. The anterior plane runs through the 
mid-portion of the short process of the incus and de- 
fines the anterior border of the aditus ad antrum and 
the posterior border of the epitympanum. The poste- 





Extrahepatic biliary atresia 
Congenital syphilis 

Multiple congenital anomalies 
Median cleft face syndrome Recurrent otitis media 
Alagille’s syndrome 

Otitis media with effusion, 
malformed auricle 


Oral-facial-digital syndrome 


Facial palsy 


rior plane, which transects the area in which the adi- 
tus ad antrum enlarges to form the mastoid antrum, 
defines the posterior border of the aditus ad antrum 
and the anterior limit of the mastoid antrum. The 
mesotympanum and hypotympanum were distin- 
guished from the epitympanum by the horizontal 
plane running through the most superior margin of 
the pars tensa of the tympanic membrane. The bony 
portion of the ET was distinguished from the tym- 
panic cavity by a vertical plane running through the 
most anterior-superior margin of the tympanic an- 
nulus. 


We focused on the following findings in each of 
the above-mentioned areas: 1) the location of the 
CG, 2) the remaining mesenchyme, 3) the degree 
of ME effusion, 4) the degree of inflammation, and 
5) the morphological abnormalities of the ET and 
its associated structures, including the cartilage and 
the tensor veli palatini muscle (TVPM). Five levels 


Fig 1. (Case 1) Photomicrograph of left temporal 
bone section obtained from 6-month-old boy 
shows uninterrupted connection (arrows) of hema- 
topoietic bone marrow (B) with remaining mesen- 
chyme (M). 
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Fig 2. (Case 4) Photomicrographs of vertical sections of left temporal bone 
from 4-year-old boy. Cholesterol granuloma (CG) is completely surrounded 
by remaining mesenchyme (*), present from aditus ad antrum (A) to epitym- 
panum (B). Small degree of effusion is present in mesotympanum (arrow in 
B). ASC — anterior semicircular canal; F — facial nerve; I — incus; LSC 

lateral semicircular canal; TM — tympanic membrane: L utricle. Right, 
left, top, and bottom sides of images are anterolateral, posteromedial, supe 
rior, and inferior directions, respectively. 


of grading were used to express the degree of re- 
maining mesenchyme: 0 (none); 1 (1% to 20%); 2 
(21% to 40%); 3 (41% to 80%); and 4 (81% to 
100%).'!° We hypothesized that the outpouring of 
components of bone marrow, including erythrocytes, 
through the remaining mesenchyme might be one of 


the causative factors of CG. The presence or absence 
of an uninterrupted connection of the hematopoietic 
bone marrow with the remaining mesenchyme (Fig 
1) was also examined. The degree of effusion within 
the ME or ET was also graded as follows: none (al- 
most no effusion); minimal (approximately 10% to 


TABLE 2. GRADES OF REMAINING MESENCHYME AND CONNECTIONS OF BONE MARROW WITH MESENCHYME 


IN SEVEN LOCATIONS 


Mesotympanum Eustac hian Ti he 


Mastoid Mastoid Aditus Ad and Bony Cartilaginous 
Case Age Air Cells Antrum Antrum Epitympanum — Hypotympanum Portion Portion 
Cholesterol granuloma 
1 6mo 3 (+)* 3 (+) 2 (+) 2 (+) 1 (+) (+) 
2 6mo 4 (+)* 4 (-) 4(-) 4 (-) l (-) (-) 
3 17 mo 3 (+)* 2 (+)* 2)” 1 (-) (5) (+) ( 
4 4y 4 (+) 4 (+) 4 (+)* 4 (+)* 1 (-) (-) ( 
SW 3 (+)* 2 (+) 2 (+) 2 (+) 2 (-) (+) ) 
6 15 y 4 (+)* 2 (-) 1 (-) 1 (=) 1 (-) (-) 
Controls 
7 3mo 1 (+) 0 0 0 1 (-) (=) 
8&8 8mo 1 (+) 1 (+) l (-) 1 (-) 1 (-) (-) ) 
9 12 mo l (=) 0 0 0 1 (-) (—) ( 
10 16 mo 1 (-) l (-) 1 (-) 1 (-) 1 (-) (—) | 
ll 4y 1 (-) 0 0 0 1 (-) (—) 
12 Sy 1 (-) 0 0 0 l=) (-) 
13. 8y 1 (-) 0 0 0 0 0 
l4 12y 0 0 0 0 0 0 
Grades of remaining mesenchyme: 0 — none; | — 1% to 20%; 2 — 21% to 40%; 3 — 41% to 80%; 4 — 81% to 100%. Plus and minus indicate 





presence and absence of connection of hematopoietic bone marrow with remaining mesenchyme, respectively. 


*Presence of cholesterol granuloma. 
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Fig 3. (Case 1) Photomicrograph of left temporal bone 
(mastoid portion) section obtained from 6-month-old boy. 
Small cholesterol granuloma (arrow) is developing from 
contiguous overflowing components (*) of bone marrow 
(B) containing erythrocytes. 


20% of the expected aerated space); moderate (ap- 
proximately 20% to 50%); or severe (more than 
50%).!! The degree of inflammation (submucosal 
mononuclear cell infiltration) was evaluated by the 
following grading system (previously described in 
detail!!-12); grade 0 (none), with almost no infiltra- 
tion; grade I (minimal), with local, sparse infiltra- 
tion; grade II (moderate), with focal, dense infiltra- 
tion; or grade III (severe), with diffuse, dense infil- 
tration. 


RESULTS 
Cases of Cholesterol Granuloma. We observed CG 


in the mastoid air cells in 5 ears, the mastoid antrum 
in | ear, the aditus ad antrum in 2 ears, and the epi- 
tympanum in | ear. Case 3 had multiple CGs that 
were diffusely distributed from the aditus ad antrum 
to the mastoid air cells. On the other hand, case 4 
had a relatively large CG expanding and extending 
from the aditus ad antrum to the epitympanum (Fig 
2) 


a). 


Table 2 summarizes the relationship between the 
remaining mesenchyme and the location of CG. All 
6 ears had mesenchyme remaining in the temporal 
bone. In all ears, the degree of the remaining mesen- 
chyme was higher than level 2 in the location in 
which CG was present. Three of the 6 ears had CG 
that was completely surrounded by the remaining 
mesenchyme (Fig 2). Direct continuity of the hemato- 
poietic bone marrow with the remaining mesenchyme 
was noticed in all ears. Significantly, all ears exhib- 
ited the connection in the ME (including the mas- 
toid air cells and antrum) in the site at which CG 
was present. Figure 3 suggests that a small CG is 
being formed from overflowing components of bone 
marrow, especially erythrocytes. 


Table 3 summarizes the grades of effusion and de- 
grees of inflammation in 7 different locations. In all 
6 ears, the effusion was noted in the mesotympanum 
and hypotympanum and the bony portion of the ET, 
from which CG was absent. In 2 of the 6 ears (cases 
l and 5), the posterior cartilaginous portion of the 
ET was filled with effusion (Fig 4). On the other hand, 
2 of the 6 ears (cases 2 and 4) had no effusion in the 
location in which CG was present. The degrees of 
inflammation were different in each case; however, 
the following findings were noted. All 6 ears showed 
low degrees of inflammation (grade 0 or I) in the 
location of the CG. The degree of inflammation was 
generally more severe in the ET, from which CG was 
absent, than in the ME. 


Apparent abnormalities of the ET and its associ- 
ated structures were noted in 3 of the 6 ears. Despite 
the presence of a well-developed TVPM, case 1 


TABLE 3. GRADES OF EFFUSION AND DEGREES OF INFLAMMATION IN SEVEN LOCATIONS _ 


























Mastoid Mastoid Aditus Ad 

Case Age Air Cells Antrum Antrum 

l 6 mo (t+); 17 (+), I GE) 

2 6mo (—), O* (-), 0 (+), 0 

3 Imo (++), I" (++), I* (+), I* 

4 4y (—), 0 (—), 0 (—), 0* 

3 7y (+++), 0* (++), 0 (++), I 

6 I5y (=), I? (++), II (++), I 





Mesotympanum Eustachian Tube 


and Bony 





Cartilaginous 








Epitympanum Hypotympanum Portion Portion 
(++), I (++), I (+++), II (+++), II 
(+), I (++), I (+++), I ? 

(++), I (++), I (+), I (+), I 
(+), I* (+), I (++), Il (++), I 
(++), I (+++), I (+), I (+++), I 
(++), I (+++), II (+), II (-), II 


Grades of effusion: (—) — none; (+) — minimal; (++) — moderate; (+++) — severe. Degrees of inflammation, as indicated by submucosal 
round cell infiltration: 0 — none; I — minimal; II — moderate; III — severe; ? — impossible to evaluate because of artifact. 


*Presence of cholesterol granuloma. 
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Fig 4. (Case 5) Photomicrograph of vertical section of 
left eustachian tube and its associated structures obtained 
from 7-year-old girl. Eustachian tube lumen (L) is com- 
pletely occupied by effusion. Note that medial lamina 
(ML) of eustachian tube cartilage is poorly developed. 
Tensor veli palatini muscle (TVP) is attached (*) to lat- 
eral lamina (LL) of eustachian tube cartilage. LVP — 
levator veli palatini muscle. Right, left, top, and bottom 
sides of image are anterolateral, posteromedial, superior, 
and inferior directions, respectively. 


showed no attachment of the TVPM to the ET carti- 
lage (Fig 5). Case 5 had a poorly developed medial 
lamina of the ET cartilage (Fig 4). Case 6 had an 
abnormally short cartilaginous portion of the ET that 
was poorly developed. 


Controls. None of the control subjects exhibited 
CG, effusion, or inflammation in any area of the ME 
or ET. 


The amount of remaining mesenchyme is pre- 
sented, with the presence or absence of an uninter- 
rupted connection with the hematopoietic bone mar- 
row, in Table 2. A small amount of remaining mesen- 
chyme (level 1) was observed mainly in the mastoid 
air cells, the mesotympanum and hypotympanum, 
and/or the bony portion of the ET; however, the single 
subject 12 years of age (case 14) did not have re- 
maining mesenchyme in any location. A connection 
to the hematopoietic bone marrow was noted only in 
2 subjects younger than 8 months of age. 


All control cases displayed normal morphologi- 


Fig 5. (Case 1) Photomicrograph of vertical section of 
mid-cartilaginous portion of left eustachian tube and its 
associated structures obtained from 6-month-old boy 
Tensor veli palatini muscle (TVP) is not attached (ar 


row) to lateral lamina (LL) of eustachian tube cartilage 
L — lumen of eustachian tube; LVP — levator veli 
palatini muscle; ML — medial lamina of eustachian tube 


cartilage. Right, left, top, and bottom sides of image are 
anterolateral, posteromedial, superior, and inferior direc 
tions, respectively. 


cal features of the ET and its associated structures, 
including 1) a firm attachment of the TVPM to the 
lateral lamina of the ET cartilage, 2) a well-devel- 
oped medial lamina of the ET cartilage that covered 
the medial wall of the ET lumen, and 3) an appropri- 
ately long cartilaginous portion of the ET. 


DISCUSSION 


Cholesterol granuloma is a histologic description 
that can often be recognized clinically. Nager’ sum- 
marized 3 factors responsible for the development 
of CG: 1) hemorrhage, 2) interference with clear- 
ance or drainage, and 3) obstruction of air exchange 
or ventilation. The initial bleeding may result from 
hemorrhagic inflammation, a vascular disorder, or 
an injury. Obstruction at any level along the ME cleft 
may result in the interruption of both clearance and 
ventilation of the peripheral ME space. Various fac- 
tors, including remaining mesenchyme, polyps, scar 
formation in the ME,!4 acquired cholesteatoma,! or 
congenital epidermoid cyst of the petrous apex,!? can 
be expected to cause obstruction. Once CG has devel- 
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oped, regardless of what initially caused the bleeding 
and obstruction, it seems to promote its own growth 
in a vicious circle by becoming a source of subse- 
quent hemorrhage and obstruction. Among these po- 
tential causative factors, we propose that the remain- 
ing mesenchyme in continuity with hematopoietic 
bone marrow and ET dysfunction may have an im- 
pact on the pathogenesis of CG, especially in chil- 
dren. ` 


It is known that primitive mesenchyme may re- 
main in the ME after birth.'4 The remaining mesen- 
chyme is thought to regress in relation to the devel- 
opment of the ME cavity.!014.15 Takahara et al!5 re- 
ported that the remaining mesenchyme had nearly dis- 
appeared within the first year in norma] infants. On 
the other hand, several studies have observed residual 
mesenchyme in the temporal bones of adults.%16 This 
residual mesenchyme has also been reported to have 
an association with chronic ME inflammation, in- 
cluding silent otitis media!’ and cholesteatoma.!8 In 
the present study, all 6 patients with CG were younger 
than 15 years of age and had a large amount of mes- 
enchyme remaining in the ME space in which CG 
was present. In particular, the CG was completely 
surrounded by mesenchyme in 3 cases. In addition, 
in 5 of the 6 ears with CG, the CG was located in the 
mastoid air cells, in which remaining mesenchyme 
would be most likely to persist.!© Although control 
subjects younger than 8 years of age exhibited a small 
amount of remaining mesenchyme, the amount was 
apparently lower than in the cases of CG. We specu- 
late that when the ME space (including the mastoid 
antrum and air cells) is surrounded by residual mes- 
enchyme, complete gas exchange through the mucosa 
is hampered. The additional insult of ET dysfunc- 
tion contributes to irreversible negative ME pressure. 
This negative pressure may result in hemorrhage from 
the mucosa (or mesenchyme), which further compro- 
mises gas exchange. Despite normal ET function, the 
remaining mesenchyme may still block the epitym- 
panum, antrum, and peripheral mastoid air cells. Our 
results suggest that a large amount of mesenchyme 
may interrupt the gas exchange through the ME mu- 
cosa.and/or mechanically obstruct the ME space. The 
presence of remaining mesenchyme may be instru- 
mental in the formation and location of CG, especial- 
ly in children. 


Linthicum et al!9 reported that fetal and newborn 
temporal bones had a direct connection between the 
hematopoietic bone marrow and the unresolved mes- 
enchyme in the ME. They believed that this connec- 
tion provided a supply of leukocytes that might ac- 
count for the low incidence of severe acute otitis me- 
dia in newborns. Although the connection has been 
reported to disappear by the age of 19 months,!9 all 


6 cases of CG, including 3 in subjects older than 3 
years of age, had the connection in the ME in which 
CG was present. However, among the controls, only 
2 subjects younger than 8 months of age displayed 
the connection. In addition, we found a histologic 
section with an outpouring of bone marrow compo- 
nents, including erythrocytes, suggesting early for- 
mation of CG (Fig 3). We speculate that persistent 
negative ME pressure may easily produce hemor- 
rhage from the mesenchyme that contains the rich 
source of erythrocytes. The persisting uninterrupted 
connection of the hematopoietic bone marrow with 
the remaining mesenchyme provides the hemorrhagic 
source, which may be important in the pathogenesis 
of CG. 


Eustachian tube dysfunction is generally accepted 
as an essential causative factor of CG.! Normally, 
the ET is opened by contraction of the TVPM pull- 
ing inferolaterally on a well-developed lateral lamina 
of the ET cartilage.2° There is limited information 
on histopathologic findings of the entire ET and its 
associated structures in cases of CG. Significantly, 
this study demonstrated that all subjects with CG had 
inflammation of the ET, as well as the ME space. 
Three of the 6 subjects with CG had morphological 
abnormalities of the ET, including hypoplastic carti- 
lage and an abnormal TVPM. These abnormalities 
may play an important role in disturbing effective 
ET opening and may predispose the ME air system 
to obstruction resulting in formation of CG. Further- 
more, in 2 of the 6 cases of CG, the posterior carti- 
laginous portion of the ET, including its narrowest 
portion,?! was completely filled with effusion. We 
speculate that such an ET cannot easily ventilate the 
ME cavity, compromising aeration of the peripheral 
mastoid air cells despite normal anatomy. It is un- 
clear whether ET obstruction is the initiating cause 
of CG or a subsequent result of this entity. There 
probably is a vicious circle of further deterioration 
of ME ventilation that promotes the development of 
CG. 


Cholesterol granuloma of the temporal bone is con- 
sidered to be a sequela of chronic otitis media with 
effusion.3 We found CG to be located most frequently 
in the mastoid air cells. In this study, 2 of the 6 sub- 
jects with CG had a clinical history of otitis media. 
All 6 cases of CG demonstrated various amounts of 
effusion, especially in the mesotympanum and hypo- 
tympanum and the bony portion of the ET. It is in- 
teresting to observe that CG was absent from these 
locations. This finding suggests that the development 
of CG does not require the immediate presence of 
effusion. The effusion may indirectly promote the for- 
mation of CG by obstructing the narrow ME spaces, 
including the ET lumen and isthmus. On the other 
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hand, we could not confirm a correlation between 
the degree of inflammation and the development of 
CG. Kamimura et al?? reported that inflammatory 
cell infiltration was observed in the ET in subjects 
without otitis media. More severe inflammation, de- 
fined as submucosal lymphocyte infiltration, was ob- 
served in the ET, rather than in the ME, and may be 
reflective of an active local immunologic ET defense 
system against infection. 


Because all 6 subjects with CG in this study had 
at least 1 systemic disease or congenital anomaly, 





these results may not reflect pathological findings 
of CG with otitis media in otherwise healthy chil- 
dren. For example, several articles!®23 have reported 
that the resorption of remaining mesenchyme is de- 
layed in patients with various congenital anomalies 
and syndromes. Therefore, further histopathologic 
study of additional cases of CG is necessary to es- 
tablish the pathogenesis. However, we believe that 
our pathological findings can still provide otolaryn- 
gologists, head and neck radiologists, and temporal 
bone histopathologists with important information 
regarding the pathogenesis of CG. 
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PATHOPHYSIOLOGY OF FACIAL NERVE PARALYSIS INDUCED BY 
HERPES SIMPLEX VIRUS TYPE 1 INFECTION 
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Herpes simplex virus type 1 (HSV-1) has been proven to be a cause of Bell’s palsy; however, the underlying pathophysiology of 
the facial nerve paralysis is not fully understood. We established a mouse model with facial nerve paralysis induced by HSV-1 infec- 
tion simulating Bell’s palsy and investigated the pathophysiology of the facial nerve paralysis. The time course of the R1 latency in 
the blink reflex tests paralleled the recovery of the facial nerve paralysis well, whereas electroneurographic recovery tended to be 
delayed, compared to that of the paralysis; these responses are usually seen in Bell’s palsy. On histopathologic analysis, intact, 
demyelinated, and degenerated nerves were intermingled in the facial nerve in the model. The similarity of the time course of facial 
nerve paralysis and the electrophysiological results in Bell’s palsy and the model strongly suggest that the pathophysiological basis 


of Bell’s palsy is a mixed lesion of various nerve injuries. 


KEY WORDS — blink reflex test, electroneurography, facial nerve paralysis, herpes simplex virus type 1, mixed nerve injury. 


INTRODUCTION 


Evidence for a viral causation of herpes simplex 
virus type 1 (HSV-1) in Bell’s palsy has been increas- 
ing in recent years.!-+ However, details of the patho- 
physiological mechanism of the facial nerve paraly- 
sis induced by HSV-1 infection have not yet been 
fully elucidated. We established a mouse model of 
acute and transient facial neuritis induced by HSV-1 
infection and demonstrated that the model simulated 
Bell’s palsy.>-? Using the model, we obtained histo- 
pathologic and immunologic findings that help to ex- 
plain the pathological mechanism of HS V-1—induced 
facial nerve paralysis and are relevant to Bell’s palsy. 


This study was designed to elucidate the patho- 
physiological mechanism of the facial nerve paraly- 
sis in this animal model. Blink reflex tests and electro- 
neurography (ENoG) were carried out at two differ- 
ent stages of the facial nerve paralysis, with imme- 
diate histopathologic examinations of the facial nerve 
by both light and electron microscopy. We obtained 
data suggesting the pathophysiological basis of Bell’s 
palsy. The results also indicate the usefulness and 
limitations of the two tests for predicting the prog- 
nosis of the paralysis and provide a key to under- 
standing the contradiction seen between the results 
of the two electrodiagnostic tests and the grade of 
facial nerve paralysis in the clinical diagnosis of 





Bell’s palsy. 


MATERIALS AND METHODS 


Experimental Animals. We used 4-week-old fe- 
male Balb/c mice for this experiment. All of the ani- 
mals were kept in our Laboratory Animal Center, and 
the studies complied with Ehime University School 
of Medicine’s guidelines for animal experimentation. 
General anesthesia was induced with intraperitoneal 
injection of sodium pentobarbital (50 mg/kg), and 
25 mL of the KOS strain of HSV-1 with a titer of 6.7 
x 10’ plaque-forming units per milliliter was inocu- 
lated into the posterior aspect of the right auricle af- 
ter scratching the skin surface 20 times with a 27- 
gauge needle. In 6 control animals, phosphate buffer 
solution was applied instead. Of the 196 mice inocu- 
lated, 106 animals (54%) developed facial nerve pa- 
ralysis exclusively on the side of inoculation. The 
facial nerve paralysis developed 6 to 9 days after 
inoculation and continued for a week, and then the 
mice recovered spontaneously. 


Recording Blink Reflex and ENoG. Electrophys- 
iological blink reflex and ENoG tests were conducted 
under general anesthesia with sodium pentobarbital 
(50 mg/kg) on a signal processor (NEC Sinax 1100). 
In the blink reflex tests, the supraorbital nerve was 
stimulated with a 0.1-ms square wave electric pulse, 
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Fig 1. Blink reflexes elicited from representative animal 
with transient facial nerve paralysis (normal and para- 
lyzed sides). 


2.0 mA in amplitude, via a bipolar surface electrode; 
the myogenic response was recorded from a needle 
electrode inserted into the ipsilateral orbicularis oculi 
muscle. A ground electrode was positioned on the pa- 
rieto-occipital muscle. The R1 latency was measured 
from the stimulus artifact to the beginning of the ini- 
tial deflection of the evoked potential. The blink re- 
flex was examined on the 10th day after inoculation 
in 15 animals with severe facial nerve paralysis. In 
12 of the 15 animals, R1 could be elicited, whereas 
in the remaining 3 animals, it could not, even after 
increasing the stimulus current. After completing the 
recording, we sacrificed the animals for histologic 
examination. On the 17th day, the blink reflex was 
examined in another 12 animals, which by then had 
recovered from the facial nerve paralysis, and R1 was 
elicited in all 12 animals. The latencies of R1 in blink 
reflex tests were recorded 3 times for each animal, 
and the mean value was used for the analysis. The 
difference in the latencies of R1 between the para- 
lyzed and normal sides was assessed with a sign test. 


ForENoG recording, the extratemporal facial nerve 
trunk posterior to the auricle was carefully exposed 
and subjected to supramaximal stimulation with a bi- 
polar electrode. The myogenic response was recorded 


AVERAGE R1 LATENCY OF BLINK REFLEX 
R1 Latency (ms) 

















Side Day 10 Day 17 
Normal 9.9412 10.64 1.8 
Paralyzed 15.741.7 11.9412 


There was significant difference in R1 latency between paralyzed 
and normal sides on 10th day after inoculation (p < .05). In contrast, 
no significant difference in latency of R1 was seen on 17th day. 





from the ipsilateral orbicularis oris muscle with sur- 
face electrodes. A ground electrode was inserted into 
the parieto-occipital muscle. The amplitude of the 
ENoG was measured from the first negative peak to 
the maximum positive peak. The ENoG value was 
defined as the ratio of the amplitude on the affected 
side to that on the normal side. The ENoG was re- 
corded on the 10th, 17th, and 90th days in 8 experi- 
mental animals and 6 control animals. The record- 
ings were repeated 3 times, and the average of 3 tri- 
als was used for the analysis. The results were com- 
pared by the Mann-Whitney test. 


Histopathologic Examination. Under deep anes- 
thesia with sodium pentobarbital, the animals were 
sacrificed by transcardial perfusion with 2.5% glu- 
taraldehyde. The extratemporal portions of the fa- 
cial nerve were histologically examined on the 10th 
and 17th days after inoculation in 8 animals each. 


50 





0 


control 


10th day 17thday 90th day 


Fig 2. Electroneurography (ENoG) values obtained on 
10th, 17th, and 90th days after inoculation (8 animals) 
and from control group (6 animals). Significant differ- 
ences in ENoG values were seen between control group 
and 10-day group (*; p < .05) and 17-day group (7; p< 
.05). There was no significant (NS) difference between 
ENoG values of 10-day group and 17-day group, or be- 
tween control group and 90-day group. 
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The facial nerves were dissected bilaterally, osmi- 
cated, dehydrated through an ethanol series, and em- 
bedded in Epon 812. The specimens were cut into 
0.5-mm-thick sections, counterstained with toluidine 
blue, and observed under a light microscope (Nikon 
Optiphot). The number of intact nerve fibers on each 
side was counted. The results were evaluated statis- 
tically with the Mann-Whitney test. 


number of intact 
nerve fibers 


2000 











0 
normal paralyzed paralyzed 
side side side 
(10th day) (17th day) 


Fig 4. Numbers of intact myelinated nerve fibers on para- 
lyzed and normal sides. Significant differences in num- 
ber of intact nerve fibers in paralyzed and normal facial 
nerves were seen on both 10th ($; p< .01) and 17th (§; p 
< .01) days. No significant (NS) difference was seen be- 
tween 10th and 17th days on paralyzed side. 


Fig 3. Bilateral extratemporal facial 
nerves of mouse with facial nerve pa- 
ralysis on 10th day. R — paralyzed side; 
L — normal side. On paralyzed side, 
degenerated nerve fibers are scattered 
among intact nerve fibers. 


The geniculate portions of the paralyzed facial 
nerve were cut into 700-A-thick sections and ob- 
served with a transmission electron microscope (Hi- 
tachi H800) on the 10th and 17th days after inocula- 
tion, in 3 animals on each occasion. 


RESULTS 

Figure | shows a series of blink reflexes elicited 
in a representative animal with transient facial nerve 
paralysis. On the 10th day, the R1 latency was pro- 
longed on the paralyzed side, coincident with the fa- 
cial nerve paralysis. By the 17th day, it had recov- 
ered to within the normal range, in parallel with the 
recovery of facial movement. The average R1 laten- 
cies in 12 animals are summarized in the Table. There 
was a significant difference in the RI latency be- 
tween the paralyzed and normal sides on the 10th day 
after inoculation (p < .05). On the other hand, no 
significant difference in the latencies of R1 was seen 
on the 17th day. 


The time course of the ENoG value is shown in 
Fig 2. In the control animals, the ENoG value aver- 
aged 98%. In the paralyzed animals sacrificed on the 
10th day, it was reduced and averaged 66%. There 
was a significant difference between the ENoG val- 
ues for the control and paralyzed animals on the 10th 
day (p < .05). Although voluntary facial movement 
had already returned to normal, the ENoG value re- 
mained significantly lower on the 17th day, averag- 
ing 74%. The ENoG recovery was delayed as com- 
pared to the remission of the facial nerve paralysis. 
It reached the normal range on the 90th day, with an 
average of 92%. There was no significant difference 
between the ENoG values for the groups sacrificed 
on the 10th and 17th days. 


The histologic specimens in Fig 3 show bilateral 
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Fig 5. Electron microscopic findings of 
geniculate portion of paralyzed facial 
nerves on A) 10th day and B) 17th day. 
On 10th day, some fibers were demyeli- 
nated with vacuolar changes in Schwann 
cells (arrows); in others, there was ax- 
onal degeneration with destruction of 
mitochondria in axon. On 17th day, most 
demyelinated nerve fibers had recov- 
ered, although some degenerate fibers 
were still present. 


extratemporal facial nerves obtained on the 10th day 
from an animal with facial nerve paralysis. On the 
paralyzed side, demyelinated and degenerated nerve 
fibers are scattered among intact nerve fibers. The 
numbers of intact nerve fibers on the 10th and 17th 
days are shown in Fig 4. There were significant de- 
creases in the number of intact nerve fibers in the 
paralyzed facial nerves on both the 10th and 17th 
days (p < .01). No significant difference was seen 
between the 10th and 17th days on the paralyzed side, 
although the facial nerve paralysis had completely 
recovered by the 17th day. 


Electron microscopic findings of the geniculate 
portion of paralyzed facial nerves on the 10th and 
17th days are shown in Fig 5. A mixed lesion con- 
sisting of intact, demyelinated, and degenerated fi- 





bers was seen in the facial motor nerve of the genicu- 
late portion on the 10th day. Some fibers were de- 
myelinated, with vacuolar changes in Schwann cells; 
in others, there was axonal degeneration with destruc- 
tion of the mitochondria in the axon. Most of the 
demyelinated nerve fibers had recovered by the 17th 
day, although some degenerate fibers were still pres- 
ent. 
DISCUSSION 

The blink reflex and ENoG are widely used for 
prognostic assessment of peripheral facial nerve pa- 
ralysis, particularly in Bell’s palsy.8-!4 Although both 
tests are clinically useful for predicting the progno- 
sis of the facial nerve paralysis, discrepancies be- 
tween the two tests and the degree of facial move- 
ment have been reported.!>-!9 For example, ENoG 
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10th day after inoculation 


axonal degeneration 






demyelination 


17th day after inoculation 
axonal degeneration 


remyelination 


Fig 6. Schema of pathological mechanism of facial nerve 
underlying this animal model. During facial nerve paral- 
ysis, on 10th day after inoculation, facial nerve pathologi- 
cal findings consist of mixed nerve damage, demyelina- 
tion, and axonotmesis. Facial nerve recovers from pa- 
ralysis by 17th day after inoculation, as soon as demyeli- 
nation is repaired and conduction block is released, al- 
though axons that underwent wallerian degeneration are 
still impaired. 


activity remains abnormally low in the late stage of 
facial palsy, even when facial movement has returned 
to normal.!8 In contrast, the blink reflex can be pro- 
longed or even disappear®-!° while the ENoG re- 
sponse remains quite normal. Qiu et al!> found a dif- 
ference in the time course of the ENoG value and 
the recovery from facial nerve paralysis in 32 pa- 
tients with Bell’s palsy. The ENoG value slowly de- 
creased in the early stage of the disease and then re- 
mained abnormally low, even after facial movement 
had returned to normal. The difference was more pro- 
nounced in patients with severe paralysis. Similar 
findings were reported by Thomander and Stalberg!” 
and Ardic et al,!° and are also seen in our animal 
model. 


Seddon” classified the degrees of peripheral nerve 


injury as neurapraxia, axonotmesis, and neurotmesis. 
Neurapraxia consists of myelin injury that blocks 
nerve conduction without developing wallerian de- 
generation. Axonotmesis is an axonal injury in which 
wallerian degeneration extends to the periphery. Neu- 
rotmesis is disruption of nerve fibers, mainly as a 
result of trauma. The degree of facial nerve injury 
can be clinically differentiated by electrical tests, in- 
cluding the blink reflex test, nerve excitability test, 
and ENoG. Neurapraxia involves a delay in R1 la- 
tency or an absence of R1 in blink reflex tests, with 
no reduction of the ENoG value. In axonotmesis or 
neurotmesis, R1 is absent in blink reflex tests and 
there is marked reduction of the ENoG value. In mod- 
els of traumatic nerve injury, the ENoG value is pre- 
sumed to be proportional to the number of surviving 
motoneurons?!.?2; however, some investigations have 
shown that patients with as much as 50% degenera- 
tion, as indicated by ENoG, may still have normal 
facial function.?3.24 Thus, normal facial function can 
be maintained even when considerable numbers of 
nerve fibers undergo degeneration. 


In the animal model, the R1 latency of the blink 
reflex test was prolonged and the ENoG value was 
reduced, in the acute stage of facial nerve paralysis 
seen on the 10th day after virus inoculation. Histo- 
logic examination of the geniculate portion of the 
facial nerve showed that within a mixture of intact 
nerve fibers, some fibers had undergone demyelina- 
tion and others had degenerated. Therefore, we think 
that both conduction block and nerve degeneration 
occur in the acute stage of facial nerve paralysis in 
our animal model. By the 17th day, when facial 
movement had recovered, demyelination had already 
recovered, although some degenerate fibers re- 
mained. Although the R1 latency of the blink reflex 
test had returned to normal, the ENoG value was still 
reduced on the 17th day; it recovered slowly there- 
after until the 90th day after virus inoculation. In 
ENoG, the compound action potentials are the aggre- 
gate myogenic responses evoked by the facial nerve 
trunks, and therefore, the amplitude of the compound 
action potentials is mainly dependent on the synchro- 
nous depolarization of motor axons. The reduced 
ENOG value on the 17th day suggests that in the re- 
covery phase, the synchronous depolarization was 
disturbed, because the conduction velocities differed 
between myelinated, degenerated, and regenerating 
fibers. These findings indicate that normal facial func- 
tion was restored by the 17th day by remyelinated 
nerve fibers. It is well known that demyelination of 
a nerve fiber is usually segmental and that recovery 
is rapid, whereas recovery from axonotmesis is slow 
because of wallerian degeneration. 


The time course of the facial nerve paralysis and 
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the results of the electrophysiological tests in our ani- 
mal model suggest that the underlying pathophysio- 
logical process of facial nerve injury in patients with 
Bell’s palsy involves a mixture of intact, demyeli- 
nated, and degenerated nerve fibers (Fig 6). The de- 
gree of facial nerve paralysis is related not only to 
nerve degeneration (axonotmesis), but also to con- 
duction block (neurapraxia). In a mixed lesion in 
which a considerable number of nerve fibers do not 
degenerate, facial movement recovers as soon as the 
conduction block is released, even though some fi- 
bers are still degenerated. 


Clinically, there is a strong correlation between 
the presence of the blink reflex in the early stage of 
Bell’s palsy and final recovery.? The R1 response of 
the blink reflex test is considered a direct and sensi- 
tive indicator of intratemporal facial nerve dysfunc- 
tion, since the electrical stimuli of the supraorbital 
nerve ending pass through the trigeminal nerve, the 
facial nucleus, and the site of the lesion in the intra- 
temporal facial nerve and then excite the orbicularis 
oris muscle. According to Ghonim and Gavilan,!° the 
absence of a blink reflex is significantly correlated 
with the severity of the nerve lesion: 78.3% of pa- 
tients without a blink reflex had an ENoG value be- 
low 25%. In our experimental study, the R1 latency 
of the blink reflex test paralleled the recovery of fa- 
cial movement, whereas ENoG recovery tended to 
be delayed. Therefore, with a mixed nerve injury, 





the R1 response of the blink reflex test is a good 
indicator for evaluating the degree of conduction 
block due to demyelination. When profound nerve 
injury develops, the R1 response disappears. The 
ENoG value is a good indicator for evaluating nerve 
degeneration. Combining the blink reflex with ENoG 
is useful for assessing a mixed lesion of intact, de- 
myelinated, and degenerated fibers, and is recom- 
mended for earlier, more accurate prediction of the 
prognosis of Bell’s palsy. 


CONCLUSION 


The pathophysiological basis of Bell’s palsy was 
explored in an HSV-1 infection—induced animal mod- 
el. Amixed lesion of intact, demyelinated, and degen- 
erated nerve fibers in the nerve trunk is the most prob- 
able pathological cause of Bell’s palsy. The functional 
gap between the facial nerve paralysis and the ENoG 
value in the late stage of Bell’s palsy is also attrib- 
uted to a mixed nerve injury. However, Bell’s palsy 
is thought to be caused by reactivation of latent HSV- 
1 infection in the geniculate ganglion. The mecha- 
nism of facial nerve paralysis in the primary infec- 
tion may not be the same as that in the reactivated 
infection. Therefore, further studies that involve pro- 
ducing animal models of facial nerve paralysis in- 
duced by reactivation of HSV-1 and elucidating the 
forms of nerve damage are under way in our labora- 


tory. 
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ACOUSTIC SIGNATURE OF THE NORMAL SWALLOW: 
CHARACTERIZATION BY AGE, GENDER, AND BOLUS VOLUME 
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Despite growing clinical use, cervical auscultation suffers from a lack of research-based data. One of the strongest criticisms of 
cervical auscultation is that there has been little research to demonstrate how dysphagic swallowing sounds are different from normal 
swallowing sounds. In order to answer this question, however, one first needs to document the acoustic characteristics of “normal,” 
nondysphagic swallowing sounds. This article provides the first normative database of normal swallowing sounds for the adult 
population. The current investigation documents the acoustic characteristics of normal swallowing sounds for individuals from 18 to 
more than 60 years of age over a range of thin liquid volumes. Previous research has shown the normal swallow to be a dynamic 
event. The normal swallow is sensitive to aging of the oropharyngeal system, and also to the volume of bolus swallowed. The current 
investigation found that the acoustic signals generated during swallowing were sensitive to an individual’s age and to the volume of 
the bolus swallowed. There were also some gender-specific differences in the acoustic profile of the swallowing sound. It is antici- 
pated that the results will provide a catalyst for further research into cervical auscultation. 


KEY WORDS — acoustic analysis, deglutition, swallowing sounds. 


INTRODUCTION 


Cervical auscultation (CA) is a noninvasive assess- 
ment of swallowing. Based on an analysis of the 
sounds of swallowing, it is an assessment that has 
researchers and clinicians alike pondering the mean- 
ing of the information that it provides. As a screen- 
ing procedure, CA could potentially determine the 
presence of dysphagia and the likelihood of aspira- 
tion. It involves the placement of a stethoscope or an 
acoustic detector unit (eg, accelerometer or micro- 
phone) onto the cervical skin in the region of the 
larynx. The acoustic signals are 1) perceptually scru- 
tinized by an examiner, if a stethoscope is used, or 
2) recorded onto audiotape, videotape, or a computer 
with an acoustic detector unit for later analysis. Cer- 
vical auscultation offers a bridge between the clini- 
cal examination of dysphagia and instrumental as- 
sessments of dysphagia. It offers a method of screen- 
ing the pharyngeal phase of the swallow, which is 
poorly achieved with a clinical examination.!3 In ad- 
dition, CA is more readily available than instrumen- 
tal assessments of dysphagia such as. videofluoros- 
copy, pharyngeal manometry, and fiberoptic endo- 
scopic evaluation of swallowing (FEES). 


Despite its. growing clinical use, CA is not seen as 
a “mainstream” dysphagia assessment tool. Docu- 
mentation of its credibility is required before it will 
be accepted on the same basis as videofluoroscopy, 
pharyngeal manometry, electromyography, and 
FEES. Logemann‘ finds 2 problems with the accep- 
tance of CA: 1) the source of generation of the sounds 





of swallowing has not been adequately identified, 
and 2) there have been no studies to determine wheth- 
er normal swallowing sounds are different from the 
swallowing sounds of dysphagic individuals. 


Previous researchers documented some of the 
acoustic characteristics of the normal swallow. Inter- 
est in the acoustic signature of the normal swallow 
first emerged circa 1965.5-7 Investigators®!° used 
spectrograms to characterize the key acoustic vari- 
ables of signal duration, intensity, and frequency. In 
reviewing the literature, it can be established that the 
average normal swallowing sound has 1) a duration 
of 0.25 to 0.8 second, 2) an intensity of between 4 
and 41 dB, and 3) a frequency range of 0 to 8 kHz, 
with a predominance of acoustic energy between 0 
and 3 kHz. However, the studies are hampered by 
small subject numbers and significant variation in 
research methodology (Table 15-10). 


The normal oropharyngeal swallow is a dynamic 
event. Research into the timing of the physiologic 
events of the normal swallow reveal that it changes 
as a function of age, gender, bolus volume, and bo- 
lus viscosity. If these variables have an appreciable 
effect on the physiologic timing of events of the nor- 
mal swallow, perhaps these variables also alter the 
acoustic signature of the swallow. 


Physiologically, the normal oropharyngeal swal- 
low is shown to slow down with advancing age. Indi- 
viduals over the age of 65 years swallow more slow- 
ly than do people under the age of 45 years.!!}!2 Both 
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TABLE 1. SUMMARY OF INVESTIGATIONS INTO CERVICAL AUSCULTATION 











Method Bolus Type 





Hearing aid microphone attached 
Handheld microphone on skin of 
Microphone placed on neck lateral 


Accelerometer on lateral border of 
trachea, immediately inferior to 





Dry swallow and 1 cup of coffee 
5-30 mL of water 
Dry swallows and swallows of 


tap water 
5 mL of water 


Authors Subjects 
Lear et al5 50 , 

loosely around neck 
Logan et alô 7 women, 9 men (mean, 32 y) 

left carotid triangle 
Mackowiak et al? Adult, male and female 

to midline 
Takahashi et al8 5 men (mean, 35 y), 5 women 

(mean, 29 y) 

cricoid cartilage 

Vice et al? 6 healthy newborn infants 


Accelerometer adjacent to lateral 


One feeding session 


aspect of larynx 


Hamlet et al! 13 with laryngectomy, 17 


controls (36-68 y) 


Accelerometer 2 cm anterior to 
and below angle of mandible 


10 mL of liquid barium, one third 
teaspoon of barium paste 








the oral and pharyngeal phases of the swallow have 
been shown to be affected by advancing age. !2-15 Ag- 
ing appears to encourage uncoupling of the oral and 
pharyngeal phases of the swallow.!2 Lebel et al!6 
found that the duration of the swallowing sound in- 
creases with advancing age. With delays between the 
oral and pharyngeal phases of swallowing, there may 
be acoustic evidence of a delay in the commence- 
ment of the swallowing sound in the elderly, as com- 
pared with the swallowing sounds of younger indi- 
viduals. 


The issue of gender-related differences in swal- 
lowing has not routinely been investigated. Robbins 
et al!! found that male and female swallows were 
similar in terms of duration and pressure character- 
istics. Only one significant difference was reported 
in this study. Women were found to have a longer 
duration of upper esophageal sphincter opening than 
do men. The authors attributed this difference to the 
anatomically smaller female pharynx and upper 
esophageal sphincter, postulating that the upper 
esophageal sphincter may need to be open longer to 
accommodate a bolus of equivalent size. Nilsson et 
al!7 noted that men ingested a bolus on average 30% 
larger than the average bolus of their female coun- 
terparts. Differences between male and female swal- 
lowing patterns have been largely attributed to ana- 
tomic differences in the size of the oropharyngeal 
mechanism. These anatomic differences may also af- 
fect key acoustic characteristics of the swallowing 
sounds of men and women. 


Age and gender are not the only variables known 
to affect the physiology of the swallow. Bolus vol- 
ume critically affects the timing of the physiologic 
events of the normal swallow.!8!9 Dantas et al2° found 
that an increase in bolus volume caused changes in 
both the oral and pharyngeal phases of swallowing. 
Lazarus et al!9 noted that with increased bolus vol- 
ume, there is an increase in laryngeal closure time. 
To date, there are no CA studies that have investi- 


gated the effect of bolus volume on the acoustic sig- 
nature of the normal swallow. Given the physiologic 
changes that occur during swallowing with alterations 
in bolus volume, it does not seem unreasonable that 
there may also be volume-related changes to the 
acoustic profile of the swallow. 


The CA literature currently lacks a substantial age- 
specific normative database. While there is little in- 
formation pertaining to dysphagic swallowing 
sounds, one of the first priorities must be to provide 
a substantial normative database with which to com- 
pare abnormal swallowing sounds. 


The purpose of this study was to use objective 
acoustic techniques to capture and analyze normal 
adult swallowing sounds over an age range spanning 
18 years to over 60 years of age in order to identify 
changes in the acoustic signature of the swallow that 
could be attributed to age, gender, or bolus volume. 


METHOD 


Subjects. We recruited 62 nondysphagic volunteers 
to participate in this investigation. Three subjects 
were excluded from the study because of 1) inability 
to adequately adhere the microphone to the cervical 
region because of excessive sebaceous secretions, 2) 
hair growth on the cervical region that caused inter- 
ference with the recording of the swallowing sound, 
or 3) inadequate acoustic recording of the signal. 
Consequently, 59 subjects participated in the study. 
The individuals were organized into 1 of 3 groups 
depending on their age. Group 1 (18 to 35 years) had 
10 men (mean [+SD], 25.2 + 3.26 years) and 10 
women (mean, 28.0 + 3.97 years). Group 2 (36 to 59 
years) had 10 men (mean, 46.8 + 5.16 years) and 10 
women (mean, 44.7 + 6.83 years). Group 3 (60+ 
years) had 10 men (mean, 67.5 + 4.25 years) and 9 
women (mean, 67.3 + 4.47 years). 


Each subject satisfied the following criteria: 1) no 
history of swallowing disorders, 2) no major surgery 
of the head and neck that would affect swallowing; 
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Fig 1. A) Raw acoustic swallowing signal and narrow- 
band spectrogram. B) Schematic diagram of swallowing 
sound. 


3) no gross structural abnormalities of the speech pro- 
duction apparatus; 4) skin on the cervical region that 
was judged to be competent (ie, not loose, stiff, or 
scaling; skin that is loose, as in obesity, stiff, as can 
be the case after radiotherapy, or scaling skin in which 
the skin is dry and flaking off are conditions that can 
cause distortions to the acoustic signal); 5) no his- 
tory of stroke, head injury, head or neck surgery, or 
progressive neurologic disease; and 6) ability to speak 
English. 


Procedure. After an explanation of the procedure, 
the subject was seated in a straight-backed chair. The 
cervical region was cleansed with an alcohol wipe. 
A microphone (EK3132 Knowles electret) was se- 
cured to the cervical region at the midline of the cri- 
coid cartilage with single-sided surgical tape such 
that there was no air escape around the microphone. 
The microphone was attached to a preamplifier (pre- 
champ K-5608) that fed the acoustic signal directly 
into the Computerized Speech Laboratory (CSL- 
4300, Kay Elemetrics). The subjects were given tri- 
als of 10-mL water swallows to allow the investiga- 
tor to set the recording for the optimal level for each 


subject. Once the optimal gains had been established 
for each individual, the controls remained fixed for 
the experimental part of the procedure. The subjects 
were requested to swallow the bolus in one complete 
action. Recording of the swallow commenced upon 
lip-to-cup contact. Recording ceased after the swal- 
low, when the larynx appeared to return to the rest- 
ing position and the individual brought the cup back 
to the resting position. The samples were recorded 
directly onto disk. A sampling rate of 20,000 Hz was 
used in order to avoid violating the Nyquist fre- 
quency, and allowing signal analysis in the range of 


“O to 8,000 Hz. 


We mixed 40 mL of Cottees Diet Fruit Cup cor- 
dial with 500 mL of water. Once the CSL settings 
had been customized for each individual, the follow- 
ing were recorded: 1) bolus swallow of 5 mL of room- 
temperature cordial, 2) 10 mL of room-temperature 
cordial, and 3) 15 mL of room-temperature cordial. 
Two samples were recorded for each volume. The 
calibrated amounts of fluid were dispensed into a 
standard 250-mL-capacity plastic cup. An order ef- 
fect was not anticipated; therefore, each subject swal- 
lowed the boluses in the order detailed above. 


Objective Acoustic Analysis. Each of the 2 swal- 
lows recorded for each subject at each volume was 
observed by listening to the signals and visualizing 
the raw acoustic waveform and its resulting narrow- 
band spectrogram. The best swallow of each pair was 
selected for detailed acoustic analysis. The “best” 
swallow was judged to be the one that was acousti- 
cally “cleanest,” and usually reflected the swallow 
with the higher signal-to-noise ratio (SNR). The best 
signal was judged from a combination of 1) visual 
inspection of the raw acoustic signal and the linked 
spectrogram and 2) perceptual analysis of the signal. 
Therefore, 1 swallow per volume, totaling 3 swal- 
lows per subject, was profiled acoustically. 


Each swallow was displayed as a raw acoustic sig- 
nal and linked by cursor with its narrowband spec- 
trogram (1,024 point, 28-Hz bandwidth, no preem- 
phasis, hamming window applied). The beginning 
of the swallowing sound was determined by a com- 
bination of auditory playback and visual inspection 
of the spectrogram. The cursor was then walked back 
0.8 second from the beginning of the swallowing 
sound. This was marked as “time zero” of the acous- 
tic signal. Therefore, each acoustic signal contained 
a period of 0.8 second prior to the beginning of the 
swallowing sound, then the swallowing sound, then 
post-swallow acoustic recordings. The signals were 
marked in this way to ensure a consistent starting 
point for each acoustic signal. Termination of the 
swallowing signal was determined from a combina- 
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TABLE 2. RELIABILITY DATA IN TERMS OF ORIGINAL 
AND REANALYZED SCORES FROM TEN PERCENT 





OF SUBJECTS 
Original Reanalyzed 
Data Data 
Variable (mean +SD) (mean +SD) t p 
Swallow sound 0.40 + 0.42+ -0.918 .366 
duration (s) 0.10 0.12 
Swallow sound 0.95 + 0.99 + -2.020 .053 
peak (s) 0.14 0.13 
Swallow intensity 43.97 + 44.69 + 0.693 .494 
(dB) 6.08 6.23 
Swallow frequency 2,393.84 2,540.27+ -1.407 .170 
range (Hz) 763.49 901.45 
Swallow region of 479.53 + 481.63+ -0.095 .925 
spectral intensity 225.34 256.72 
(Hz) 











tion of 1) cursor-linked displays of the raw acoustic 
signal and its spectrogram and 2) perceptual assess- 
ment. The swallowing sound was complete when the 
spectrogram trace showed that the signal had returned 
to the baseline (Fig 1). The peak of the swallowing 
signal was determined from the energy profile, show- 
ing the loudest point of the swallowing sound in deci- 
bels. 


Acoustic signals are 3-dimensional and can be de- 
scribed in terms of duration (seconds), frequency 
(hertz), and amplitude (decibels). Durational mea- 
surements, recorded in seconds or milliseconds, pro- 
vide information about the length of the acoustic sig- 
nal. Frequency refers to the number of complete cy- 
cles of movements in 1 second.?! It is measured in 
hertz, so that 1 Hz equals 1 cycle per second. Hu- 
man perception of frequency is directly related to 
the pitch (high versus low) of the sound. Amplitude 
refers to the strength of the sound. In its strictest 
sense, amplitude refers to the physical aspects of the 
signal, whereas loudness relates to our perception of 
it. Fry?! states that “loudness depends on the energy 
in the sound, which is proportional to both ampli- 
tude and frequency of the signal.”21090) The term for 
this is sound intensity, and it is measured in deci- 
bels. 


Given that the acoustic signal is 3-dimensional, 
the swallowing sound was measured for characteris- 
tics of duration, frequency, and intensity. In brief, 
the length of the sound was measured (seconds), the 
“pitch” or frequency range of the sound was mea- 
sured (hertz), and the loudness of the sound was mea- 
sured (decibels). More specifically, the following 
acoustic measurements were recorded. 


. The length of the swallowing sound (seconds) was 
recorded and describes the duration from the begin- 
ning of the swallowing sound to the end of the swal- 
lowing sound. The time at which the swallowing 


sound reached its peak intensity was also recorded. 


The intensity (decibels) or loudness of the swal- 
lowing sound was determined from an energy spec- 
trum (5-ms frame length, no smoothing). The SNR 
was calculated by subtracting the minimum ampli- 
tude level of the acoustic signal from the peak am- 
plitude level of the swallowing signal. The higher 
the value of the resulting figure, the “louder” the sig- 
nal was deemed to be, as compared with the inher- 
ent noise of the signal. As an analogy, a good SNR 
can be likened to tuning a radio station properly. If 
the radio station is not well tuned, there will be static 
mixed with the signal (ie, low SNR). If the radio sta- 
tion is well tuned, there is no static, and the signal is 
crisp and clear (ie, high SNR). Acoustic recordings 
are rarely good enough to totally exclude background 
noise; hence, a high SNR is desirable. The peak in- 
tensity of the swallowing sound was recorded, 


The pitch or frequency (hertz) of the swallowing 
sound was also recorded. Two measures were taken 
for this parameter. Measurements were obtained from 
a narrowband spectrogram. The frequency range 
(hertz) of the swallowing sound was recorded. This 
gave information as to the highest “pitch” of the swal- 
lowing sound. In addition, a frequency reading was 
also taken at the loudest part of the swallowing sig- 
nal. It was referred to as the “region of spectral in- 
tensity” and can be seen schematically in Fig 1. 


Figure 1 shows an example of a raw acoustic swal- 
lowing signal and its narrowband spectrogram. A 
schematic diagram indicating the measurements 
taken from the acoustic signal is also included. 


A single investigator carried out the testing and 
analysis. The results from 6 subjects were randomly 
chosen for reanalysis. One male and 1 female indi- 
vidual from each age group were randomly selected, 
representing approximately 10% of the data. Paired 
t-tests were used to ascertain any differences between 
the original scores and the reanalyzed scores for the 
objective measures. The results showed there to be 
no significant differences between the original scores 
and the reanalyzed scores (Table 2). There was 100% 
agreement for the perceptual data. 


Perceptual Acoustic Assessment. In addition to the 
objective assessments of duration, intensity, and fre- 
quency outlined above, subjective judgment of the 
acoustic signal was also recorded. Swallowing sounds 
were Classified as either “distinct” or “other.” Dis- 
tinct sounds were crisp and clearly distinguished from 
the rest of the acoustic signal. 


RESULTS 


Perceptual Analysis. The swallowing sounds were 


yi 
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TABLE 3. RESULTS OF PERCEPTUAL ACOUSTIC ASSESSMENT FOR THIN FLUID BOLUS SWALLOWS OF 
DIFFERENT SIZES 
18-35 y 36-59 y i 260 y 
Bolus Size Women (n=10) Men(n=10) Women(n=10) Men(n=10) Women(n=9) Men(n=10) 
5 mL ; 
Distinct 90 100 90 90 80 100 
Other 0 0 0 0 0 j 20 
10 mL 
Distinct 80 100 90 90 71 100 
Other 0 0 10 0 0 0 
15 mL 
Distinct 100 90 100 100 100 100 
Other 0 0 0 0 20 10 


Data are percentages of subjects displaying each type of swallow. “Distinct” sounds were crisp and clearly distinguished from rest of acoustic 


signal. 





distinguishable at all times. The swallowing sounds 
were distinct more than 70% of the time for all vol- 
umes and for all groups. They were distinct 90% of 
the time or more for the 15-mL bolus swallows. Re- 
viewing all volumes and all groups, we saw that there 
were 7 occasions on which an “other” swallow was 
coded. Of these, 5 were multiple swallows of 1 bo- 
lus, while the remaining 2 were short, shrill, high- 
pitched—sounding swallows that were perceptually 
different from the other swallows in the sample. The 
15-mL bolus generated the highest number of mul- 
tiple swallows. The shrill, high-pitched swallows oc- 
curred only in men over the age of 60, at either the 
5-mL volume or the 15-mL volume. There were 4 
subjects (3 men, 1 woman) for whom the swallow- 
ing sound was “distinct” but not as crisp as for the 
remainder of the population. The 3 men were all of 
thin build and had large, prominent laryngeal carti- 
lages. The woman had a “thick neck,” although she 
would not have been classified as obese. These skel- 
etal factors may have contributed to the perceptual 
quality of the swallowing sounds in these subjects. 
Table 3 shows the perceptual results for the 5-, 10-, 
and 15-mL swallows. 


Objective Acoustic Analysis. There were 2 dura- 


_ tional variables (seconds), 1 intensity variable (deci- 


bels), and 2 frequency variables (hertz) analyzed. The 
subject numbers were sufficient for us to use para- 
metric statistical applications. It is an underlying as- 
sumption of parametric statistical analysis 1) that the 
data are normally distributed and 2) that there is ho- 
mogeneity of variance. In terms of normal distribu- 
tion, the duration data were found to be positively 
skewed, while the frequency data were negatively 
skewed. The intensity data were shown to approach 
the normal distribution. Glass et al?” have noted that 
in the absence of heterogeneity of variance, skewed 
populations have very little effect on either the level 
of significance or the power of the test. Levene’s test 


for homogeneity of variance was therefore applied 
to the data. The duration and intensity data did not 
violate the assumption of homogeneity of variance. 
The frequency data, however, violated the assump- 
tion of homogeneity of variance and were, therefore, 
submitted to naperian log transformations. Although 
this transformation improved the normality of the dis- 
tribution and the homogeneity of variance, the data 
became uninterpretable. It was decided that it would 
be better to analyze the original skewed data in or- 
der to be able to interpret the results. Under these 
circumstances, Tabachnick and Fidell23 advocate that 
it is acceptable to run the parametric assessment, but 
use a more stringent o level. Consequently, for the 
duration and intensity data, an œ level of 0.05 was 
deemed acceptable, while the frequency data were 
set to the more stringent o level of 0.025 to reach 
statistical significance. A repeated-measures multi- 
variate analysis of variance was applied to the dura- 
tion and frequency data, while a repeated-measures 
analysis of variance was applied to the intensity data, 
as there was only 1 variable under investigation for 
this parameter. Pillai’s trace was used to flag signifi- 
cant effects. The design of the experiment was such 
that the between-subjects parameters were age and 
gender. The within-subjects factor was the volume 
of the bolus being swallowed. 


The multivariate analysis of variance results for 
duration indicated that there was a main effect for 
volume (F16,37 = 6.079, p < .001). A volume-by-age 
interaction (F32,76 = 1.646, p = .040) and a volume- 
by-gender interaction (F16,37 = 1.936, p = .049) were 
also noted. Follow-up univariate F tests were applied 
to the data. The Newman-Keuls procedure was ap- 
plied for post hoc analysis of significant F tests.24 
There were no main effects and no interaction ef- 
fects noted for the intensity data for any of the vari- 
ables under investigation (age, gender, and bolus vol- 
ume). As there were no main effects and no interac- 
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o 
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18-35 y 36-59 y 260 y 18-35 y 36-59 y 260 y 
5§mL 10 mL 


tion effects, the intensity data were collapsed across 
age, gender, and volume. The swallowing sounds had 
a mean SNR of 43.25 dB (SD, 1.73 dB). There were 
no main effects for the frequency data. There was, 
however, a volume-by-gender interaction (F16,28 = 
2.42, p = .029). Friedman’s test was applied to the 
frequency data for univariate analysis, and appropri- 
ate nonparametric post hoc analyses were applied 
where required. SPSS for Windows (version 7.5) was 
used to complete the statistical analysis (SPSS, 1997). 


The acoustic signature of the normal swallow as 
characterized by age, gender, and bolus volume is 
provided below. Of the 3 dimensions of the acoustic 
signal (duration, intensity, and frequency), duration 
was shown to be the most sensitive feature. Smaller 
variations were noted for the frequency domain; how- 
ever, no variations were found with respect to inten- 
sity. The results for duration, frequency, and inten- 
sity are expanded upon below. 


Of the 2 durational features assessed, the “loca- 
tion of peak intensity of the swallowing sound” (Fig 
1) was insensitive to variations in age, gender, and 
bolus volume. The peak intensity of the swallowing 
sound occurred, on average, 0.993 second (SD, 0.038 
second) after the beginning of the acoustic signal. 
Given that the swallowing sound was stabilized to 
begin 0.8 second from the beginning of the acoustic 
signal, the location of the swallow peak occurred 
0.193 second into the swallowing sound. 


The duration of the swallowing sound was sensi- 
tive to changes in volume (p = .001) and also showed 
evidence of a volume-by-age interaction (p < .0001). 
The duration of the swallowing sound was signifi- 
cantly shorter for the 15-mL volume than for either 
the 5- or 10-mL volumes. Post hoc analyses of the 
volume-by-age interaction yielded the following in- 
formation. For the 5-mL volume only, the group 1 


3000 







Fig 2. Volume-by-age interaction of swal- 
lowing sound duration. Light bars — means; 
dark bars — SDs. 





36-59 y 260 y 
15 mL 


swallowing sound duration was significantly shorter 
than the group 2 and group 3 durations. Age-specif- 
ic patterns also emerged. For the youngest group 
(group 1), the 10-mL scores were significantly longer 
than either the 5- or 15-mL scores. For group 2, the 
15-mL scores of swallowing sound duration were sig- 
nificantly shorter than either the 5- or 10-mL scores. 
For the oldest group (group 3), the 5-mL swallow- 
ing sound durations were significantly longer than 
the 10- or 15-mL scores. Figure 2 illustrates the vol- 
ume-by-age interaction described above. This Fig- 
ure also demonstrates that there was the greatest vari- 
ation between individuals of different age groups 
when they were swallowing a 5-mL bolus volume. 


The frequency range of the swallowing sound 
showed a volume-by-gender interaction (Fig 3). 
There were significant differences noted for men’s 
scores (p = .002), but not for women’s scores (p = 
.172). The women’s scores were therefore collapsed 
for this parameter to yield a swallowing sound fre- 
quency range of 2,297 Hz (SD, 246 Hz). Post hoc 
analyses of the men’s frequency range scores showed 
the 5-mL volume scores (2,635 Hz) to be signifi- 
cantly higher than their 10-mL (2,123 Hz) or 15-mL 
(2,088 Hz) scores. The region of spectral intensity 
was found to be stable across age, gender, and bolus 
volume. The results were therefore collapsed to pro- 
vide a value of 550.96 Hz. 


DISCUSSION 


Cervical auscultation is a screening assessment for 
dysphagia that shows great potential for use as an 
adjunct to the clinical examination. It is noninvasive 
and requires limited patient cooperation. It may po- 
tentially provide clinicians with additional informa- 
tion regarding swallowing status that enables them 
to better refer patients for diagnostic assessments and 
to monitor improvement or deterioration in swallow- 
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Fig 3. Volume-by-gender interaction of 
swallowing sound frequency range. Light 
bars — means; dark bars — SDs. 500 
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ing status. However, before the technique can be ap- 
plied to dysphagic individuals, clinicians first need 
a normative database with which to compare dys- 
phagic swallowing sounds. It is only in characteriz- 
ing “normal” that “abnormal” can be properly re- 
flected. This study showed that swallowing sounds 
are as dynamic as the act of swallowing itself. 


This discussion will detail the acoustic signature 
of the normal swallowing sound and will compare it 
with current CA research and the physiologic swal- 
lowing literature. The discussion will address each 
of the acoustic components of duration, intensity, and 
frequency of the swallowing sound. 


Duration, The authors of articles on CA are in 
agreement that the duration of the swallowing sound 
is between 0.25 and 0.8 second.>*:!0.16.25 The find- 
ings from the present investigation fall between these 
values at approximately 0.4 second. The wide range 
reported in the literature is explained by the fact that 
there is little consistency in what researchers clas- 
sify as the beginning and the end of the swallowing 
sound. For example, the raw signals and cursor-linked 
spectrograms shown in the article by Takahashi et 
al?5 show that they interpreted more of the signal as 
the “swallowing signal” than we did. 


The current study showed swallowing sound du- 
ration to vary as a function of bolus volume. Swal- 
lowing sound duration decreased as bolus volume 
increased. Boiron et al?° found that the duration of 
the swallowing sound increased with an increase in 
bolus volume. These results are at odds with the re- 
sults of the current study. Swallowing physiology 
provides an explanation for the difference in find- 
ings. Dantas et al?° show that the velocity of the bo- 
lus head in a small-volume swallow (15 cm/s) is con- 
siderably less than it is for a large-volume swallow 
(50 cm/s). If the velocity of the bolus head increases 
with an increase in bolus volume, then it makes sense 
that it will traverse the pharynx more quickly and 
produce a shorter swallowing sound. Such an expla- 
nation would support the results of the current study. 
The study of Boiron et al falls into the category of 
acceleration studies, using a chart recorder to record 
data. The measurements of Boiron et al were taken 
from a raw signal in isolation, and there is little ex- 
planation of how the durational measurements were 
determined. The current study used acoustic analy- 
sis, and used computer software to link the raw acous- 
tic signal to a spectrogram and concurrent audio play- 
back to determine the length of the swallowing sound. 
The current study provides more accurate informa- 
tion about the duration of the signal. 


In addition to the significant effect for volume, 


the present study also found a significant age-by-vol- 
ume interaction (5-mL volume only). The youngest 
group recorded a swallowing sound duration of 0.377 
second. This value was significantly shorter than that 
of the middle group (0.485 second) and that of the 
older group (0.524 second). There were no differ- 
ences between the middle and older groups. Lebel et 
al!6 have also shown that swallowing sound dura- 
tion increases with advancing age. Researchers of 
swallowing physiology have found that swallowing 
is considerably slower after the age of 65 years than 
under the age of 45 years.!!:!* A similar pattern of 
results was noted for the 15-mL volume, although 
the difference between the groups was not statisti- 
cally significant. These results are supportive of the 
physiologic swallowing literature, and show that CA 
is also sensitive to these age-related changes. An un- 
expected finding was that the swallowing sound du- 
rations for all 3 groups clustered around the same 
value for the 10-mL volume. Further studies are re- 
quired to see whether these results can be replicated. 


In addition to changes in the acoustic signal re- 
lated to age and bolus volume, previous CA studies 
have found a gender bias for swallowing sound du- 
ration. Lebel et al!® and Takahashi et alë noted that 
women have shorter swallowing sounds than do men. 
This finding was not replicated in the present study. 
Very little variability was noted between the two gen- 
ders for swallowing sound duration (5 mL: women 
0.46 second, men 0.45 second; 10 mL: women 0.46 
second, men 0.44 second; 15 mL: women 0.41 sec- 
ond, men 0.38 second). Perhaps methodological vari- 
ation may account for the differences found between 
the present study and previous CA studies. 


In addition to the duration of the swallowing 
sounds, there was one other durational variable in- 
vestigated: the “location of the swallow peak.” The 
location of the swallow peak refers to the time (in 
seconds) at which the swallowing sound reaches peak 
intensity. The current investigation found that the lo- 
cation of the swallowing peak occurred uniformly 
0.993 second after the beginning of the acoustic sig- 
nal and 0.193 second from the beginning of the swal- 
lowing sound. It was stable across ages, genders, and 
bolus volumes. 


It is possible that the “location of swallow sound 
peak intensity” may be an indicator of pharyngeal 
contraction activity. Physiologically, pharyngeal 
clearance is insensitive to volume, and it is mini- 
mally affected by age.!!.!2.27-29 We purport in another 
article* that at least part of the swallowing sound is 
caused by reverberations of the pharyngeal walls fol- 
lowing delivery of the bolus to the pharynx by the 
tongue. If this is the case, it is conceivable that the 
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location of the swallowing sound peak may be a 
marker of pharyngeal contraction and clearance. This 
may explain its insensitivity to factors such as bolus 
volume, age, and gender. 


Intensity. Swallowing sound intensity was found 
to be stable at 43 dB and insensitive to age, gender, 
or bolus volume. In effect, this means that the mecha- 
nism driving the bolus is strong and consistent across 
volumes, age groups, and gender. Given that pharyn- 
geal contraction also remains stable across ages, gen- 
ders, and bolus volumes, the finding of a stable swal- 
lowing sound intensity should not be surprising: Sta- 
bility of swallowing sound intensity appears to be 
supported by the physiologic swallowing literature. 
In the CA field, however, Lebel et al!§ noted that the 
swallowing sound becomes louder with advancing 
age. The results of the current study do not support 
the findings of Lebel et al. 


Lebel et allé also noted that there was a gender 
difference in the intensity of the swallowing sound. 
They indicated that women produced softer-inten- 
sity swallowing sounds than did men. No significant 
difference as a result of gender was found in the pres- 
ent study. Also, no significant gender-related differ- 
ences for intensity of the swallowing sound were 
found in the acoustic study of Takahashi et al.8 Meth- 
odological variations may explain the differences in 
results. 


Frequency. There is little consistency in the way 
in which the frequency components of the swallow- 
ing sound have been determined in previous CA in- 
vestigations. The reported frequency range of swal- 
lowing sounds is said to be up to 8 kHz, based on 
spectrographic diagrams.® Hamlet et al?! argued that 
a predominance of acoustic energy could be found 
between 0 and 3 kHz. The current study found the 
maximum frequency range of swallowing sounds to 
be 5,900 Hz, with a mean frequency range score of 
2,200 Hz. 


The current study found a volume-by-gender in- 
teraction for swallowing sound frequency range. Oth- 
er CA studies have not specifically investigated gen- 
der-related frequency differences in swallowing 
sounds. In the current study, the female frequency 
range scores were comparable across bolus volumes, 
showing a mean value of 2,297 Hz. The male fre- 
quency range volumes, however, differed depend- 
ing on the volume of the bolus swallowed. The male 
frequency range for the 5-mL volume was signifi- 
cantly higher than both the 10-mL and 15-mL val- 
ues. The gender bias may in some way be related to 
anatomic differences between males and females. As 
noted in the Introduction, men appear to be designed 
to swallow larger boluses, whereas women seem de- 


signed to swallow smaller boluses. These observa- 
tions stem from differences in preferred bolus size.32 
It is possible that men, rarely swallowing a bolus as 
small as 5 mL, manipulate it differently than their 
customary larger-volume boluses. Physiologically, 
the swallowing configuration of the oropharynx is 
achieved earlier and lasts longer for a large-volume 
bolus, as compared with a small-volume bolus.?8 Dif- 
ferences in anatomic configuration of the pharynx 
would affect the frequency response of the pharynx 
and the acoustic signal generated. The ratio of bolus 
volume to the size of the oropharynx may explain 
the differences recorded in the male frequency range 
of swallowing sounds. 


It is an interesting finding that the frequency range 
of the swallowing sound mirrors the F2 formant range 
of the vowel sound /i/ (as in heed). The formants in 
speech change depending on the anatomic configu- 
ration of the vocal tract. Formants, therefore, reflect 
the shape of the vocal tract during production of in- 
dividual sounds. “For the vowel sound /i/, the front 
of the tongue is high in the mouth so that the rear 
section of the tract is very long, while the part in 
front of the tongue contraction is very short.”21(@77) 
This configuration provides a frequency of 2,280 Hz 
for F2. If one particular vocal tract shape consistently 
produces a speech sound, then perhaps a specific oro- 
pharyngeal configuration could also produce a dis- 
tinctive swallowing sound. The current study found 
the average swallowing sound frequency range to be 
2,289 Hz. The configuration of the vocal tract as one 
says /i/ is not too dissimilar to the event just prior to 
swallowing as the tongue delivers the bolus to the 
pharynx. Hence, it would be no coincidence that the 
frequency range of the swallowing sound and the fre- 
quency component of /i/ are almost identical. These 
frequency results may provide physiologic support 
for the validity and sensitivity of the CA findings. 
Further research in this area is encouraged. - 


The frequency at which the greatest spectral inten- 
sity of the swallowing sound occurred was recorded. 
In other words, the frequency of the loudest portion 
of the swallowing sound was determined from the 
spectrographic results. The present study found an 
average region of spectral intensity of 550.96 Hz, 
stable across volume, gender, and age. Previous in- 
vestigators have determined this value from fast Fou- 
rier transforms and spectrograms. Hamlet et al re- 
corded an energy peak at 556 Hz and another at 1,384 
Hz, although the latter was “not necessarily the high- 
est-amplitude portion of the spectrum.”33@75 There 
was a 40-ms gap between the 2 peaks, and the val- 
ues were taken from fast Fourier transforms of the 
swallowing sound. Mackowiak et al” used spectro- 
grams and also found an initial low-frequency peak 
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of 400 to 600 Hz, then a larger peak at 1,000 Hz at 
the end of the swallow. The present study recorded 
only 1 region of spectral intensity. Because not ev- 
ery swallow demonstrated 2 regions of spectral in- 
tensity within the swallow, where 2 elements were 
noted, an average of the 2 points was recorded. The 
results from the current study clearly support those 
from previous CA investigations. 


SUMMARY 


The current investigation provides the first large 
age-specific database for normal swallowing sounds. 
This study supports, in part, previous CA investiga- 
tions conducted on smaller subject numbers and us- 
ing differing methodologies. This study finds CA to 
be both sensitive to the dynamics of swallowing and 
reflective of physiologic changes that occur with 
swallowing. The acoustic parameters of duration, 
intensity, and frequency were characterized. The av- 
erage swallowing sound is approximately 0.4 sec- 
ond in duration, with an intensity of 43 dB and a 
frequency range of 2,200 Hz. The peak intensity of 
the swallow occurs approximately 0.19 second into 
the swallowing sound and registers a frequency of 
approximately 550 Hz. The swallowing sound is, 
however, a dynamic entity. Its duration is altered as 
a function of age and bolus volume. There are also 


elements of the swallowing sound that are stable, such 
as intensity, and the location and frequency of the 
peak portion of the swallowing sound. In the field of 
dysphagia research, the most highly respected physi- 
ologic information comes from videofluoroscopic, 
manometric, electromyographic, and fiberoptic stud- 
ies. The results from the current study were shown 
to support the physiologic literature. It was also dem- 
onstrated that in the same way that we adopt charac- 
teristic vocal tract shapes during the production of 
speech, there may be a characteristic posturing of the 
oropharynx during swallowing. The frequency range 
data support this hypothesis. The results of the cur- 
rent investigation provide a point of reference for dys- 
phagic swallows. The duration, intensity, and frequen- 
cy data may help clinicians to distinguish healthy 
swallows from dysphagic swallows. 


Physiologically, swallowing characteristics are also 
noted to change when the swallowing mechanism 
breaks down. There are well-documented physiologic 
differences between healthy swallowing and dysphag- 
ic swallowing function. Further research is required 
to show whether dysphagic swallowing sounds are 
indeed different from normal swallowing sounds. Fu- 
ture investigations should examine the characteris- 
tics of dysphagic swallowing sounds. 
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EXPOSURE TO 200 PPM OF METHANOL INCREASES THE 
CONCENTRATIONS OF INTERLEUKIN-18 AND INTERLEUKIN-8 IN 
NASAL SECRETIONS OF HEALTHY VOLUNTEERS 
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This study was designed to investigate subclinical irritating effects of methanol on functional and immunologic parameters in 
human respiratory epithelia. Twelve healthy, nonsmoking individuals were exposed to concentrations of 20 and 200 ppm of methanol 
in an exposure chamber. The concentrations of interleukin (IL)-8, IL-1, IL-6, and prostaglandin E2 (PGE2) were monitored. The 
saccharin transport time test was used to evaluate mucociliary transport. Video interference contrast microscopy was used to deter- 
mine the ciliary beat frequency of nasal epithelial cells. Subjective symptoms were assessed with a questionnaire. The median 
concentrations of IL-8 and IL-18 were significantly elevated after exposure to 200 ppm of methanol as compared to exposure to 20 
ppm (IL-1, 21.4 versus 8.3 pg/mL, p = .001; IL-8, 424 versus 356 pg/mL, p = .02). The release of IL-6 and PGE2 did not change 
significantly (IL-6, 10.3 versus 6.5 pg/mL, p = .13; PGE2, 13.6 versus 13.4 pg/mL), nor did the ciliary beat frequency or the saccharin 
transport time. Both IL-8 and IL-1B proved to be sensitive indicators for subclinical irritating effects of methanol in vivo. The 
German threshold limit of 200 ppm of methanol does not prevent subclinical inflammatory reactions of the nasal respiratory mucosa. 


KEY WORDS — ciliary beat frequency, interleukin, methanol, mucociliary transport, nasal respiratory epithelium, prostaglan- 
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INTRODUCTION 


It has been shown in the past that different organic 
solvents can cause symptoms of nasal irritation, if 
individuals are exposed to sufficient concentrations. 
Scientific evaluations of these irritations and their 
causative mechanisms are difficult and usually are 
performed with the help of questionnaires with sub- 
jective symptom scales.! However, it seems neces- 
sary to assess objective parameters to define dose- 
dependent effects on a clinical, as well as a subclini- 
cal, level. 


Methanol is an organic solvent that is widely spread 
in our environment. In 1981, for example, the world- 
wide production was estimated at around 12 million 
tons. Most of it is used for the production of sub- 
stances like formaldehyde, but it is also used for 
glues, paints, cellulose, and resins. Methanol is an 
important solvent for urea, nitrocellulose, and other 
substances and is used as an extractor in technical 
laboratories. Occupational exposure to higher con- 
centrations of methanol is likely for individuals who 
work in chemical industries and in occupations re- 
lated to the production of glues, paints, solvents, etc.? 
Such occupational exposures are to various mixtures 
of substances including methanol, as opposed to 


methanol alone. However, for the sake of analyzing 
methanol-specific effects without interactions with 
other substances, individuals in this study were ex- 
posed solely to methanol. 


Proinflammatory cytokines, such as interleukin 
(IL)-8, IL-1, and IL-6, are known to be involved in 
nasal epithelial inflammatory reactions. Such inflam- 
matory responses have been observed after exposure 
of human respiratory mucosa to nitrogen dioxide 
(NO2), diesel exhaust particles, and ozone,‘ and they 
appear likely to follow exposures to organic solvents’ 
as well. Another important inflammatory mediator 
that can also be synthesized in human respiratory mu- 
cosa is prostaglandin (PG) E2.®? Short-term exposure 
to ozone has been shown to result in elevated PGE2 
concentrations in human bronchoalveolar fluid.’ The 
upper respiratory tract mucosa is the primary site of 
contact for viral, bacterial, and chemical agents that 
are inhaled with the environmental air. Mucociliary 
transport plays an important role in the nonimmuno- 
logic defense of the mucosa, as it represents a first 
physical barrier that absorbs, neutralizes, and elimi- 
nates inhaled pollutants.?-!2 To assess mucociliary 
clearance, we monitored the saccharin transport time 
(STT) as an overall parameter for the effectiveness 
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of the mucociliary system, and the ciliary beat fre- 
quency (CBF) in order to investigate possible cili- 
ary dyskinesia or akinesia. Mucociliary transport has 
been shown to be a sensitive parameter to show ad- 
verse or toxic effects in respiratory epithelia.!3-15 


This investigation was conducted to elucidate the 
extent of inflammatory reactions of human upper 
respiratory tract mucosa in vivo after a 4-hour metha- 
nol exposure at 200 ppm (German threshold limit) 
by monitoring the release of the proinflammatory me- 
diators IL-8, IL-1B, IL-6, and PGE2, as well as STT 
and CBF, and by using a questionnaire to assess sub- 
jective symptoms. 


PATIENTS AND METHODS 


Design. In this study, 12 healthy individuals (av- 
erage age, 26.8 years; SD, 2.1 years) were exposed 
to both 200 ppm and 20 ppm of methanol over 4 hours 
in an exposure chamber. We used 20 ppm instead of 
0 ppm as a control to blind the study because of neu- 
rophysiological investigations* that revealed that 20 
ppm of methanol can be recognized by individuals 
with normal olfaction. The interval between the two 
measurements was 1 week. This interval was chosen 
because the effects were expected to be mild enough 
to have been reversed completely by that time. The 
interval had proven to be effective in a similar study 
with trichloroethane.> All subjects were male, non- 
smoking students who were not taking any systemic 
or topical medication. Besides medical history-tak- 
ing and physical examination, all individuals under- 
went electrocardiography, spirometry, blood count, 
determination of y-glutamy] transpeptidase level, and 
determination of erythrocyte sedimentation rate. The 
subjects had no history of chronic sinusitis. Further- 
more, nasal endoscopy, ultrasound examination of the 
paranasal sinuses, anterior rhinomanometry, acous- 
tic rhinometry, STT, and olfactory function testing 
were performed before inclusion in the study. A brief 
medical check was repeated before each exposure, 
and contact lenses were not permitted in the exposure 
chamber. The conditions were standardized, with all 
individuals being at physical rest and temperature and 
humidity being regulated in the chamber. Twenty 
minutes after the 4 hours of exposure, the STT was 
measured and specimens of nasal secretion as well 
as of ciliated epithelial cells were harvested. A previ- 
ous investigation showed that measurable effects 
could be found in the interleukins chosen for this study 
after 4 hours of exposure. The protocol was designed 
in accordance with the Declaration of Helsinki (Hong 
Kong version of 1989), and before we obtained the 
tissue specimens, the protocol and informed consent 
form were approved by the institutional review board 
for human studies at our institution. 
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Exposure. The exposures were carried out in a stat- 
ic 18-m? exposure chamber. The concentrations of 
methanol were 203.5 ppm (SD, 2.5 ppm) and 20.3 
ppm (SD, 3.8 ppm), monitored with an infrared ana- 
lyzer (Miran 980, Foxboro Analytical, South Nor- 
walk, Connecticut). The temperature and humidity 
were kept at 20.4°C (SD, 0.7°C) and 48.1% (SD, 
2.6%), respectively. The airflow was less than 0.1 
m/s, and there was a complete change of air 6 times 
per hour. 


Assessment of Interleukins, PGE2, STT, and CBF. 
Nasal secretions were collected with absorbent foam 
rubber samplers in a technique well established for 
the determination of inflammatory mediators in na- 
sal secretions./ Quantitative assessment of the inter- 
leukins (IL-8, IL-1B, and IL-6) was performed with 
commercially available enzyme-linked immunosor- 
bent assays (HS immunoassays, R&D Systems, Min- 
neapolis, Minnesota) according to the manufacturer’s 
instructions. The release of PGE2 was analyzed with 
a competitive enzyme immunoassay as described by 
Schaefer et al.!7 The CBF was calculated by video 
interference contrast microscopy at 10 different sites 
of each specimen as described by Hafner et al.!8 The 
STT was determined by placing a small particle of 
saccharin on the mucosa of the head of the inferior 
turbinate and measuring the time that elapsed be- 
fore the individual reported detecting a strong sweet 
taste. 


Subjective Symptoms. A questionnaire of the Swed- 
ish Performance Evaluation System (SPES!%), in the 
German translation of Seeber et al,20 was completed 
by all individuals before and after 2 hours and 4 hours 
of exposure. The questionnaire contained 17 items 
related to local irritation of the nasal mucosa, diffi- 
culty in breathing, and prenarcotic symptoms. The 
symptoms were to be marked on an ordinal scale from 
0 (no symptoms) to 5 (severe symptoms). All indi- 
viduals had been familiarized with the questionnaire 
before the beginning of the experiment. Four sub- 
jects received Liquifilm eye drops (Pharm Allergan, 
Ettlingen, Germany) to prevent frequent blinking 
while their electroencephalogram (EEG) was re- 
corded with their eyes open over a period of 11 min- 
utes; thus, any symptoms related to irritating effects 
on the eyes of these particular subjects were not eval- 
uated. 


Statistical Evaluation. Statistical evaluation was 
performed with the SAS software package (version 
6.12). The concentrations of interleukins and PGE2 
following exposure to 200 ppm and 20 ppm of metha- 
nol were compared by the 1-tailed Wilcoxon test for 
matched pairs. This 1-tailed test was chosen because 
the alternative hypothesis was that the concentration 
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Median concentrations of A) interleukin-8 (n = 12) and B) interleukin-18 (n = 12) were significantly elevated after exposure 


to 200 ppm of methanol versus 20 ppm. 


of these parameters would increase. This test was also 
applied to evaluate the results of the questionnaire; 
for each experimental session, differences in scores 
“after 2 hours” minus “before” and “after 4 hours” 
minus “before” were calculated. Effects on the STT 
and CBF were tested with the 2-tailed Wilcoxon test 
for matched pairs. For each test, p < .05 was consid- 
ered significant. l 
RESULTS 


The median concentrations of IL-8 and IL-1 (see 
Figure) were significantly elevated after exposure to 
200 ppm of methanol versus 20 ppm, but no signifi- 
cant changes were seen in the concentrations of IL-6 
and PGE2 (Table 1). Neither the STT nor the CBF 
changed significantly after exposure to 200 ppm of 
methanol versus 20 ppm (Table 2). The scores on 
the subjective symptom questionnaire relating to ir- 
ritations before and after exposure to both 20 ppm 
and 200 ppm of methanol were not different (Table 
319), 

DISCUSSION 

This study revealed that the proinflammatory me- 
diators IL-8 and IL-1 are sensitive markers that can 
be detected in nasal secretions as a sign of a subclini- 
cal proinflammatory reaction after in vivo exposure 
to methanol at a concentration of 200 ppm. In an 
earlier investigation, we also found significant in- 
creases of IL-6, IL-8, and IL-1 after exposure to 
200 ppm of 1,1,1-trichloroethane.> Under the condi- 
tions used in this present study, the release of IL-6 


did not change significantly. However, the limited 
statistical power of this investigation, with only 12 
individuals being evaluated, needs to be kept in mind. 
The observation of a significant increase of IL-8 and 
the mild inflammatory responses in this pilot study 
might seem somewhat phenomenological in this 
small group of patients; however, these findings have 
to be further evaluated in future investigations using 
longer exposures such as 8 hours and, possibly, a 
combination with in vitro experiments using much 
higher concentrations. It is known that cytokines de- 
tected in nasal secretions may be synthesized by a 
number of different cells, and there is increasing evi- 
dence that nasal epithelial cells play an important 
role in the generation of proinflammatory mediators 
such as interleukins.?!-26 It has been suggested that 
pollution-induced inflammation of the upper and low- 
er airways may be due to increased synthesis and/or 
release of epithelial cell-derived mediators that in- 
fluence the activity of inflammatory cells* such as 
macrophages, neutrophils, eosinophils, mast cells, 
and lymphocytes. 


A 1-hour exposure to 1.39 ppm of methyl tertiary- 
butyl ether (MTBE) did not cause any nasal irrita- 
tion clinically, nor did the concentration of IL-8 or 
the number of neutrophils increase significantly in 
nasal lavage fluid of these individuals as compared 
to controls.2”7 This was a considerably low concentra- 
tion of MTBE, which might simply have been too 
low to cause measurable effects after only 1 hour of 


TABLE 1. CONCENTRATIONS OF INTERLEUKINS AND PROSTAGLANDIN E2 IN NASAL SECRETIONS 























20 ppm Methanol 200 ppm Methanol 
n Ist Quartile Median 3rd Quartile Ist Quartile Median 3rd Quartile p 
IL-18 (pg/mL) 11 4.1 8.3 13.9 9.6 21.4 43.5 O01 
IL-8 (pg/mL) 12 225 356 482 254 424 550 .02 
IL-6 (pg/mL) 12 4.4 6.5 9.5 3.9 10.3 13.8 -13+ 
PGE2 (ng/mL) 12 12.7 13.4 13.6 12.6 13.6 13.8 49* 


*Not significant. 
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TABLE 2. EFFECTS OF METHANOL ON SACCHARIN TRANSPORT TIME AND CILIARY BEAT FREQUENCY 























20 ppm Methanol 200 ppm Methanol 
n Ist Quartile Median 3rd Quartile Ist Quartile Median 3rd Quartile p 
Saccharin transport time (s) 12 384 526 7i 435 527 728 52* 
Ciliary beat frequency (Hz) 12 TA 78 9.3 6.95 8.2 9.2 89% 


*Not significant. 





exposure. However, MTBE is known to cause irrita- 
tions in nasal mucosa at high concentrations. Diaz- 
Sanchez et al28 showed that nasal challenge to diesel 
exhaust particles leads to measurable expression of 
messenger RNA of various interleukins in cells gath- 
ered from nasal lavage fluid. 


In our present study, we did not find any signifi- 
cant changes in the concentration of PGE2 after 4 
hours of exposure to 200 ppm of methanol. This find- 
ing indicates that PGE2 is not sensitive enough as a 
marker to detect subclinical inflammatory irritations 
after methanol exposure in vivo. In an investigation 
by Koren et al,® the concentration of PGE2 increased 
after acute exposure to 0.4 ppm of ozone over 2 hours 
in bronchial lavage fluid, but not in nasal lavage fluid. 
Ongoing studies, especially in vitro exposures to 
higher concentrations, will further elucidate the role 
of PGE2 in the response mechanisms of respiratory 
epithelial cells after exposure to air pollutants. 


We did not see any impairment of CBF after acute 


TABLE 3. RESULTS OF SUBJECTIVE SYMPTOM 
QUESTIONNAIRE 


Before Exposure After 4-h Exposure 














Item n 20ppm 200 ppm 20 ppm 200 ppm 
Headache 12 0 0 0 0 
Dizziness 12 0 0 0 0 
Nausea 12 0 0 0 0 
Tiredness 12 0 0 1 1 
Pain or pressure 

over chest 12 0 0 0 0 
Coughing spells 12 0 0 0 0 
Shortness of breath 12 0 0 0 0 
Irritation to eyes 9 0 0 0 0 
Watering eyes 9 0 0 0 0 
Blurred sight 12 0 0 0 0 
Irritation to nose 12 0 0 0 0 
Running nose 12 0 0 0 0 
Sensation of bad 

smell 12 0 0 0 0 
Irritation to throat 12 0 0 0 0.5* 
Sensation of 

unpleasant taste 12 0 0 0 0 
Irritation to skin 12 0 0 0 0 
Feeling of fainting 

or vertigo 12 0 0 0 0 


Data are median scores obtained on Swedish Performance Evalua- 
tion System.!9 0 — no symptoms; 5 — severe symptoms. 
*Not significant. 











exposure to 200 ppm of methanol, in accordance with 
the results of Andersen et al,22 who wrote that muco- 
ciliary clearance was not impaired after 6 hours of 
exposure to 100 ppm of toluene in vivo. The expo- 
sure time was limited to only a few hours in these 
two studies, so they do not exclude measurable ef- 
fects that may occur after chronic long-term expo- 
sures, as in occupationally exposed workers. A 24- 
hour exposure to 400 ppm of isopropanol impaired 
ciliary activity in pigs.!* However, it is important to 
realize that reports on effects on CBF in different 
animal models are controversial, and that significant 
differences in mucociliary clearance between differ- 
ent species have been shown.3933 Nihlén et al34 ex- 
posed healthy individuals to 5, 25, and 50 ppm of 
MTBE over 2 hours. Albumin, eosinophilic cationic 
protein, myeloperoxidase, lysozyme, and total cell 
count were measured in nasal lavage fluid before, 
during, and after exposure to MTBE. However, no 
clear dose-effect caused by MTBE was shown with 
this experimental design. The same group conducted 
a similar experiment with 0, 5, 25, and 50 ppm of 
ethyl tertiary-butyl ether (ETBE), analyzing the same 
parameters and following the same protocol.35 Again, 
the results did not reveal any significant effects. In 
that study, IL-8 was measured additionally after ex- 
posure to 50 ppm; however, no effect on the IL-8 con- 
centration that was believed to be caused by ETBE 
was seen. 


In conclusion, in our present study, acute expo- 
sure to 200 ppm of methanol in vivo caused a sub- 
clinical inflammatory response, indicated by in- 
creased concentrations of proinflammatory cytokines 
(IL-8 and IL-1) in nasal secretions. Subjective clini- 
cal irritations were not detected under the conditions 
used. Whether this response of the nasal respiratory 
mucosa plays an important role in the induction of 
acute and chronic nasal diseases will have to be evalu- 
ated in ongoing studies. It will be important to ana- 
lyze the effects of subacute exposures and to use a 
suitable in vitro model to evaluate higher concentra- 
tions of toxic air pollutants. It is also important to 
keep in mind that after having looked at the effects 
of different singular substances, we should continue 
to elucidate the effects of combinations of these sub- 
stances in an effort to enhance our understanding of 
the environmental effects on human nasal respira- 
tory mucosa. 
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Speech recognition scores were analyzed in 34 carriers of a DFNAS mutation. Cross-sectional linear regression analysis (last 
visit, maximum recognition score in %Correct on age or PTA1,2,4 kHz) established onset age (score 90%) at 16 years and onset 
PTA1,2,4 kHz level (score 90%) at 41 dB hearing level. The deterioration rate was 0.7%/y in the plot of maximum score against age, 
whereas the deterioration gradient was 0.4%/dB in the plot of maximum score against PTA1,2,4 kHz. Given the previously demon- 
strated rapid progression of hearing impairment, speech recognition was relatively good: at age 70, the score was still >50%. 


KEY WORDS — DFNAS, genetic hearing loss, speech recognition. 


INTRODUCTION 


Sensorineural hearing impairment is one of the 
most frequent handicaps in the world. Since the late 
1800s, hereditary factors have been recognized among 
its causes. Our knowledge about inherited sensori- 
neural hearing impairment has expanded tremen- 
dously, owing to the development of genetic linkage 
techniques. These techniques enable us to localize a 
region within the DNA in which the affected gene is 
situated. According to the mode of inheritance, a ło- 
cus for nonsyndromic inherited hearing impairment 
is called DFNA (autosomal dominant), DFNB (auto- 
somal recessive), or DFN (X-linked). The loci are 
numbered in chronological order of discovery. Once 
a locus has been mapped, the search for the causa- 
tive gene can begin. Currently, almost 80 loci for non- 
syndromic hearing loss have been found, and 21 genes 
have been identified.! 


In 1965, Huizing et al? described a very large Dutch 
family with autosomal dominant sensorineural hear- 
ing impairment. At the last follow-up study (1994 to 
1997), the family comprised 121 affected members 
(unpublished data). Several phenotype studies on the 
family have been published, providing a detailed 
analysis of the segregating trait.28 The type of hear- 
ing impairment was characterized as postlingual and 
progressive; the high frequencies were most severely 





affected and showed the highest, especially initial, 
progression. Linkage and genotype analyses were per- 
formed in the past few years.?"!! The trait was mapped 
to chromosome 7p15 and given the designation 
DFNAS.? After refinement of the region, the disease- 
causing gene was isolated and named DFNAS, be- 
cause the function of the gene (which would suggest 
anew name) is as yet unknown.!0.!! Cochlear expres- 
sion of a DFNAS homolog was found in the mouse.!! 


To further delineate the phenotypic features, we 
analyzed the speech recognition scores of a number 
of DFNAS mutation carriers. 


PATIENTS AND METHODS 


Audiometry was performed with a portable or a 
clinical audiometer according to common clinical 
standards.%8 For speech recognition, standard pho- 
netically balanced Dutch word lists were presented 
to either ear. Speech recognition scores (right ear on- 
ly) were retrieved from 34 DFNAS5 mutation carriers 
7 to 67 years of age. Performance-intensity curves 
relating to the phoneme recognition score were ana- 
lyzed (last visit). We included the speech reception 
threshold (decibels sound pressure level at 50% pho- 
neme recognition score) and the maximum (unaided) 
phoneme recognition score (%Correct) at which the 
pure tone average at 1, 2, and 4 kHz (PTA1,2,4 kHz) 
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Fig 1. Cross-sectional speech recognition scores at last visit against A) age and B) pure tone average at 1, 2, and 4 kHz 


(PTA1,2,4 kHz). Asterisks indicate outlying values. 


could be measured. 


Plots of speech reception threshold versus PTA1,2,4 
kHz were used with linear regression analysis to check 
for the reliability of the phoneme recognition scores. 
Chauvenet’s criterion!” and the residual standard de- 
viation were used to identify and exclude outlying 
values. Linear regression analysis of the maximum 
phoneme recognition score on age (performance-age 
plot) or PTA1,2,4 kHz (performance-impairment plot) 
was performed with the Prism PC version 3.02 pro- 
gram (GraphPad, San Diego, California). X90, ie, the 
value of X where Y = 90% (called onset age for the 
performance-age plot and onset level for the perfor- 
mance-impairment plot), and its 95% confidence in- 
terval were derived by a nonlinear regression method 
to fit the equivalent linear equation Y = Slope(X — 
X90) + 90. Slope was called deterioration rate in the 
performance-age plot, whereas it was called deteri- 
oration, gradient in the performance-impairment plot. 
Longitudinal recognition scores (n = 2-12) were 
available and analyzed in 10 cases. 
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RESULTS 


The speech reception threshold (data not shown) 
correlated significantly with the PTA1,2,4 kHz level 
and was generally in agreement with the pure tone 
audiogram. The maximum phoneme recognition 
scores are plotted against age in Fig 1A. The speech 
recognition scores deteriorated significantly (slope 
> 0) with increasing age. The onset age was 16 years 
(95% confidence interval, 10 to 22 years), and the 
deterioration rate was 0.7%/y (95% confidence in- 
terval, 0.4%/y to 0.9%/y). Previously, on the basis 
of the analysis of pure tone audiograms, van den Wijn- 
gaart et al® found the onset age to be between 5 to 15 
years. Figure 1B shows an onset level of 41 dB (95% 
confidence interval, 30 to 52 dB) and a deterioration 
gradient of about 0.4 %/dB (95% confidence inter- 
val, 0.2%/dB to 0.6%/dB). Figure 1A and Fig 1B 
each include 1 outlying value (asterisk) representing 
a patient who showed exceptionally rapid progres- 
sion for the trait. He was followed for many years 
(Fig 2). Interestingly, this patient received a cochlear 


100-p=c0 
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Fig 2. Individual longitudinal speech recognition scores against A) age and B) PTA1,2,4 kHz are depicted by circles and con- 
tinuous lines. Asterisks pertain to same patient as in Fig 1. Dotted line indicates 5% lower normal limit for cochlear losses 


established at our laboratory. 
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implant and has performed very well since. This 
patient’s score, as well as the scores of the other pa- 
tients, were well above the 5% lower confidence limit 
for cochlear lesions observed at our laboratory (Fig 
2). 


Longitudinal data are depicted in Fig 2. The pa- 
tient selected for cochlear implantation at age 26 
years (n = 12) is easily recognized by a large drop in 
speech recognition scores (Fig 2A, asterisk). How- 
ever, despite this patient’s exceedingly poor (unaided) 
scores, the usual relationship between score and 
PTA1,2,4 kHz level was maintained. Some, but not all, 
of his scores were lower than our 5% lower normal 
limit (dotted line). 

DISCUSSION 


The DFNAS patients showed onset of deteriora- 
tion of speech recognition in their second decade (on- 
set age, 16 years). This onset age is much younger 
than that previously found in patients with other pro- 
gressive, autosomal dominant traits that predomi- 
nantly affect the high frequencies, such as DFNA2/ 
KCN@Q4 (34 years) and DFNA9/COCH (43 years).!5 
The deterioration rate in our DFNAS patients (0.7%/ 
y) was higher than that in our DFNA2 patients (0.3%/ 
y), but much lower than that in our DFNAQ patients 
(1.8%/y). It was checked that the threshold levels 
typical of our DFNAS patients were fairly similar to 
those in our DFNA2 and DFNA9 patients (data not 
shown). 


The relatively slow deterioration in speech recogni- 
tion is remarkable in light of the rapid progression in 
hearing loss documented for our DFNAS patients.?°8 


Their extrapolated maximum score was still >50% 
at about 70 years of age (Fig 1A). A puzzling excep- 
tion was the young man whose excessive deteriora- 
tion prompted cochlear implantation at the age of 26 
years. The fact that he performed very well after im- 
plantation (data not shown) supports our assumption, 
based on a murine cochlear expression study,!! that 
the DFNAS mutation causes purely cochlear dysfunc- 
tion. 


The onset PTA1,2,4 kHz level in our DFNAS pa- 
tients was about 40 dB hearing level; this was fairly 
similar to the onset level in DFNA9 patients (about 
45 dB hearing level), but much lower than that in 
DFNA2 patients (65 dB).!3 On the other hand, the 
deterioration gradient in our DFNAS patients (0.4%/ 
dB) was fairly similar to that in our DFNA2 patients 
(0.5%/dB), whereas it was lower than that in our 
DFNA9 patients (1.2%/dB).13 At similar PTA1,2,4 kHz 
levels (>60 dB), the maximum phoneme scores of 
our DFNAS patients were between those of our 
DFNA2 and DFNA9 patients (data not shown). 


We previously suggested that DFNA2/KCNQ4-re- 
lated high-frequency sensorineural hearing impair- 
ment can be attributed to a lack of expression of K* 
channels in mutation carriers, especially in inner hair 
cells of the lower cochlear turns.!3 Relative sparing 
of outer hair cell function in these patients could ac- 
count for their relatively good speech recognition 
scores, We speculate that DFNAS patients retain rela- 
tively good speech recognition scores on a similar 
basis. Hopefully, more detailed expression studies 
will clarify this issue in the near future. 
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LEMON-FLAVORED COD LIVER OIL AND A MULTIVITAMIN- 
MINERAL SUPPLEMENT FOR THE SECONDARY PREVENTION OF 
OTITIS MEDIA IN YOUNG CHILDREN: PILOT RESEARCH 


LINDA A. LINDAY, MD JAY N. DOLITSKY, MD 
NEw York, NEW YORK , NEW YORK, NEW YORK 
RICHARD D. SHINDLEDECKER, MA C. E. PIPPENGER, PHD 
NEW ROCHELLE, NEW YORK ALLENDALE, MICHIGAN 


We measured blood levels of fatty acids, vitamin A, and trace metals in children undergoing ambulatory surgery for placement 
of tympanostomy tubes and a comparison group having other ambulatory surgical procedures. We then performed a small, outpa- 
tient, secondary prevention study using nutritional supplements chosen on the basis of those blood levels. The study subjects had 
lower levels of red blood cell eicosapentaenoic acid (EPA) than did adult controls. Consistent with previous reports, the levels of 
vitamin A were <40 g/dL for 69% of our subjects, and the plasma selenium levels for children were lower than published values for 
adults. We then studied one otitis media (OM) season; 8 children (0.8 to 4.4 years of age) received 1 teaspoon of lemon-flavored cod 
liver oil (containing both EPA and vitamin A) and 1 half-tablet of a selenium-containing children’s chewable multivitamin-mineral 
tablet per day. During this OM season, study subjects received antibiotics for OM for 12.3% + 13.4% (SD; p < .05) fewer days during 
supplementation than before supplementation. Larger, controlled trials are warranted to assess the utility of cod liver oil (of accept- 
able purity and taste) and a children’s multivitamin-mineral preparation containing selenium, both for the prevention of OM and for 
the acceptance of delayed prescription of antibiotics for this disorder. 


KEY WORDS — eicosapentaenoic acid, fatty acid, selenium, vitamin A. 


INTRODUCTION l is still a need for innovative strategies for the pre- 


Otitis media (OM) is a common! and costly? prob- vention and treatment of OM. 
lem in infants and young children. Frequent OM with 


ERA : : : The association between respiratory virus infec- 
effusion in early childhood may be associated with tons and AOM in children is well established.>? Al- 


ne pati ee AE aaa SA na though viral vaccines are in development, the use of 
AIVE TO ANONSSIPS AYS MORASS CERTIGO Y OSA vaccines to prevent upper respiratory tract infections 


34 : arte i ; l C 
KRE f e i a AEri oe he ee is hampered by the large number of different viruses 
patois Be VO TAE en en, causing these infections. It is therefore noteworthy 


. pate Tosti bacteria.” Streptococcus p sa cack that oral supplementation with essential fatty acids!° 
niae causes 40% to ° OnT cases Of acute harterial and zinc!! can decrease the incidence of respiratory 
Ontis media (AM): Hona 25% to 50% of CASES infections in children. Furthermore, vitamins (espe- 
of S pneumoniae in the United States are now resis- cially vitamin A22), trace metals, and omega-3 fatty 


icillin.56 6 ; : : : 
tant to Penicillin aliens Black et aP aeponed acids have important effects on immune function. 13-14 
that the heptavalent pneumococcal polysaccharide 


conjugate vaccine decreased tympanostomy tube (TT) Free radicals are molecules, or molecular frag- 
placement by 20%, Eskola et al’ reported only a 6% ments, with 1 or more unpaired electrons in their out- 
reduction in the number of episodes of AOM from er orbit; they are highly reactive chemical entities 
any cause with this vaccine. Uhari et al8 demonstrated that can initiate oxidative stress and damage lipid 
that xylitol administered 5 times per day in chewing membranes by a process known as lipid peroxidation, 
gum or syrup can prevent AOM in children attend- and are also important in the inflammatory process. !5 
ing day care. However, these investigators noted that Free radical—induced lipid peroxidation may play a 
a more practical dosing schedule was needed before role in the acute injury!6 and chronic inflammation!7 
xylitol could be widely recommended. Thus, there of a guinea pig model of acute, unilateral OM due to 
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infection with S pneumoniae. Inflammatory media- 
tors have been demonstrated in both experimental and 
human middle ear effusions,!® and the inflammatory 
response is important in the pathogenesis of both 
pneumococcal infections!’ arid influenza A virus in- 
fections.2° Antibiotics may have anti-inflammatory 
-actions in addition to their antibacterial effects.?! 


We hypothesized that children undergoing inser- 
tion of TTs because of frequent OM and/or effusion 
would have deficiencies in 1 or more nutritional pa- 
rameters that influence the immune, inflammatory, 
and antioxidant systems, including essential fatty 
acids, vitamin A, and trace metals. We also hypothe- 
sized that oral supplementation with the deficient nu- 
trients would decrease the frequency of OM and/or 
the need for treatment with antibiotics in some chil- 
dren. We therefore performed the following study in 
2 parts. In part 1, we measured blood levels of es- 
sential fatty acids, vitamin A, and trace metals in chil- 
dren undergoing ambulatory surgery for placement 
of TTs, as well as in a comparison group of children 
having other ambulatory surgical procedures. In part 
2, we performed a small, outpatient, secondary pre- 
vention study using nutritional supplements chosen 
on the basis of the results of part 1. 


PART 1: BLOOD LEVELS OF FATTY ACIDS, 
VITAMIN A, AND TRACE METALS 


METHODS 


Subjects. Our subjects were children between 1 
and 10 years of age admitted to the Ambulatory Sur- 
gery Unit of The New York Eye and Ear Infirmary 
(NYEE) for clinically indicated surgery. The TT 
group consisted of children undergoing placement 
of TTs for frequent ear infections and/or persistent 
‘ middle ear effusion, with or without concomitant ade- 
noidectomy and/or tonsillectomy. The comparison 
group (COMP) was composed of children undergo- 
ing eye muscle surgery, as well as those undergoing 
ear, nose, and throat procedures such as bronchos- 
copy or laryngoscopy that did not involve the ears, 
adenoids, or tonsils. General study exclusion crite- 
ria included a history of frequent streptococcal throat 
infections; chronic medical conditions (eg, epilepsy, 
heart or kidney disease); a medical syndrome (eg, 
Down syndrome); and treatment with chronic medi- 
cations other than antibiotics. Exclusion criteria for 
comparison subjects included a history of frequent 
ear infections, TTs, or an ear infection in the 3 months 
prior to entry into the study. This research was ap- 
proved by the NYEE Institutional Review Board 
(IRB). Before surgery, written informed consent was 
obtained from at least 1 parent; written assent was 
obtained from children over 6 years of age. Each sub- 
ject’s ambulatory surgery chart was reviewed, and a 


study history was obtained. The families were ques- 
tioned regarding their child’s ingestion of vitamins 
and other nutritional supplements; they were also 
questioned regarding symptoms of night blindness 
in their child. Subjects were classified as Hispanic 
on the basis of language and/or surname. This study 
was bilingual; written consent forms were available 
in Spanish; consents and histories were obtained by 
a Spanish-speaking investigator (L.A.L.). 


Laboratory Analyses. The subjects had undergone 
an overnight fast, per standard preoperative proto- 
col. Blood samples were obtained after the induction 
of anesthesia, at the time an intravenous line was 
placed, and before surgery was performed; anesthe- 
sia was induced by mask with a combination of oxy- 
gen, nitrous oxide, and sevofluorane. For fatty acid 
analyses, 7 mL of whole blood was drawn into a col- 
lection tube containing ethylenediaminetetraacetic 
acid (EDTA) as the anticoagulant. Per directions from 
the initial fatty acid laboratory, specimens from the 


_first 23 subjects were kept cold from the time they 


were drawn; red blood cells (RBCs) were separated 
by centrifugation and washed twice with an equal 
volume of 0.9% saline solution; and washed RBCs 
were then frozen at —20°C” in glass vials sealed with 
Parafilm. Upon changing fatty acid laboratories, these 
specimens (which were frozen at —20°C for 10 days 
to 21/2 months) were transferred to a —80°C freezer 
and subsequently shipped overnight on dry ice to the 
Peroxisomal Diseases Laboratory of the Kennedy 
Krieger Institute (Baltimore, Maryland), where they 
were stored at —80°C until analyzed by previously 
published techniques.?3 Specimens from the next 16 
subjects were processed per the standard protocol of 
the Peroxisomal Diseases Laboratory; blood was 
drawn into an EDTA collection tube and shipped over- 
night at room temperature to the Laboratory, where 
RBCs were frozen under nitrogen at -80°C for a max- 
imum of 12 months until extraction.”3 


For vitamin A (retinol) and trace metal levels, 7 
mL of whole blood was drawn into a trace element 
collection tube containing EDTA as the anticoagu- 
lant and wrapped in aluminum foil to protect it from 
light. The specimens were kept cold (<10°C) from 
the time they were drawn and were shipped over- 
night in insulated containers with frozen cold packs 
to MedTox Laboratories (St Paul, Minnesota). There, 
RBCs were separated from plasma, washed 3 times 
with physiologic saline solution, and lysed with de- 
mineralized water; the specimens were then frozen 
at—70°C until analyzed. The specimens were handled 
in this manner to allow the determination of RBC 
free radical-scavenging enzymes,?425 which will be 
reported separately. Plasma vitamin A (retinol) and 
trace metal analyses were performed by standard 
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TABLE 1. RED BLOOD CELL LEVELS OF FATTY ACIDS 


























Tympanostomy Tube Group Comparison Group Adult Control Group* 

(NYEE Children; n = 16) (NYEE Children; n=5) (Kennedy Krieger; n = 6) 
n-6 18:2n-6 (linoleic) 9.35 + 0.28 9.46 + 0.50 8.55 + 0.45 
20:3n-6 (dihomo-g-linolenic) 1.57 + 0.08 1.69 + 0.15 1.70 + 0.14 
20:4n-6 (arachidonic) 15.94 + 0.36 15.44 + 0.64 16.36 + 0.59 
n-3 18:3n-3 (alpha linolenic) 0.29 + 0.06 0.23 + 0.10 0.29 + 0,09 
20:5n-3 (eicosapentaenoic) 0.31 + 0.027 0.31 + 0.04} 0.48 + 0.04 
22:6n-3 (docosahexaenoic) 3.78 + 0.33 3.65 + 0.60 3.97 + 0.54 
Totals Total n-6 33.49 + 0.40 33.38 + 0.72 32.31 + 0.66 
Total n-3 6.35 + 0.36 6.10 + 0.65 6.79 + 0.60 
Total n-9 13.56 + 0.26 13.44 + 0.47 13.76 + 0.43 


Data are mean percentage of total fatty acids + SE. 


*Adult Control data supplied by Ann Moser of Peroxisomal Diseases Section of Kennedy Krieger Institute Genetics Laboratory (Baltimore, 


Maryland). 


{Values for New York Eye and Ear Infirmary (NYEE) groups are less than those for adult control group; p = .007, Kruskal-Wallis analysis of 


variance by ranks. 





techniques.?27 Plasma vitamin A levels were deter- 
mined by high-performance liquid chromatography 
with ultraviolet detection (HPLC-UV); the reporting 
limit was 0.025 g/mL. Trace metals were assayed 
by inductively coupled plasma-mass spectrometry; 
the detection limits were 25 ng/mL for selenium, 20 
ug/dL for zinc, and 0.5 g/mL for copper. 


Data Analysis. Levels of RBC fatty acids were ana- 
lyzed by a 1-way analysis of variance (ANOVA) with 
a significance level of .05.28 Levels of eicosapenta- 
enoic acid (EPA) were also analyzed with the non- 
parametric Kruskal-Wallis ANOVA by ranks.?8 Plas- 
ma selenium, zinc, and copper levels for the NYEE 
TT group were compared to data from the published 
literature.2+.25 The variances for selenium, zinc, and 
copper were tested across groups; only the variances 
for selenium were homogeneous (Bartlett’s test for 
homogeneity of variances??). Therefore, only seleni- 
um was tested with a 1-way ANOVA, and differences 
between individual groups were assessed by the New- 
man-Keuls test.?? 


RESULTS 


Demographics. Specimens were obtained from 39 
children in the NYEE TT group; vitamin A and trace 
metal determinations were performed for all of these 
subjects; RBC fatty acid levels are available for 16 of 
these subjects (see Red Blood Cell Fatty Acids, be- 
low). Only RBC fatty acid levels are available for the 
5 children in the NYEE COMP group. The subjects 
in the TT group had a mean (SD) age of 3.8 + 1.8 
years; 72% were male; approximately half were His- 
panic and half were white; approximately half were 
private patients; and almost half were taking vitamin 
supplements. The COMP subjects were slightly older, 
with a mean age of 5.7 + 1.8 years, and a lower per- 
centage (20%) of private patients. No demographic 
information is available for the 6 adults in the adult 





control (AC) group; these data were kindly provided 
by Ann Moser of the Peroxisomal Diseases Section 
of the Kennedy Krieger Institute Genetics Labora- 
tory (Baltimore, Maryland). 


Red Blood Cell Fatty Acids. Oxidation of RBC 
fatty acids was reported in most of the specimens 
from the first 23 children, and was considered to be 
due to improper storage, particularly, freezing the 
specimens at —20°C (Ann Moser, personal commu- 
nication, 2000). This finding was unexpected, as Bell 
et al2? did not find loss of RBC fatty acids in control 
(nonschizophrenic) samples stored above —70°C. We 
therefore report only the RBC essential fatty acid re- 
sults for the subsequent 16 subjects, whose speci- 
mens were mailed directly to Kennedy Krieger (see 
Laboratory Analyses, above) and had no evidence 
of oxidation. 


The RBC fatty acid data are summarized in Table 
1. The mean values for EPA were lower in both groups 
of NYEE children than in the AC data supplied by 
Kennedy Krieger. The mean (ESE) RBC EPA values 
were 0.31% + 0.02% in the TT group (n= 16), 0.31% 
+ 0.04% in the COMP group (n = 5), and 0.48% + 
0.4% in the AC group (n= 6). These differences were 
statistically significant when analyzed by both para- 
metric (ANOVA, p < .002; F2,24 = 8.336) and non- 
parametric (Kruskal-Wallis ANOVA by ranks, p = 
.007; H2,27 = 9.924) tests. No other significant dif- 
ferences in RBC fatty acids were noted among the 3 
groups, although additional differences might be- 
come apparent with larger sample sizes. 


Vitamin A. The mean (+SD) plasma vitamin A (ret- 
inol) level for the TT group was 39.1 + 10.8 g/dL 
(range, 23.6 to 72.1 g/dL; n = 39). (Multiply conven- 
tional units by 0.0349 to convert to international 
units.°°) Ballew et al?! define an inadequate vitamin 
A level as less than 20 g/dL, and a suboptimal vita- 
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TABLE 2, PLASMA LEVELS OF SELENIUM, ZINC, AND COPPER 








Tympanostomy Tube Group 
(NYEE Children; n = 39) 





Children*4 (n = 83) 





Adult Long Island Jewish Adult West Coast 
Controls?’ (n = 12) Controls?’ (n = 14) 


Selenium ng/mL 110 + 16* 106 + 18* 129 +22 142 +18 
umol/L 1.40 £0.21 1.35 £0.23 1.64+0.27 1.81 4 0.23 

Zinc ug/mL 0.90.1 1.00.2 1.3+02 1.8+0.5 
umol/L 13.9+2.0 15.3+3.1 19.9+3.1 27.5 +7.6 

Copper pg/mL 1.340.2 1.2403 1.1402 1.3 +0.5 
umol/L 20.7 3.3 18.8 £4.7 17.3+3.1 20.4+7.8 


Data are mean + SD. 
*Children < adults; p < .005, analysis of variance. 





min A level as less than 30 ug/dL. The upper limit of 
the pediatric reference range for vitamin A is 80 ug/ 
dL.30 Thus, overall, the values for our subjects were 
within the reference range. However, Russell? states 
that a vitamin A level of 40 ug/dL is the level that 
predicts normal dark adaptation 95% of the time; 69% 
of our subjects (27/39) had vitamin A levels of <40 
ug/dL, although the parents denied symptoms of night 
blindness for all children. Our finding is consistent 
with the report of Ballew et al stating that the 75th 
percentile for serum retinol levels in children 4 to 8 
years of age was 39.0 ug/dL. In addition, 15% of our 
sample (6/39) had suboptimal levels (<30 g/dL), al- 
though none had inadequate levels.3! In this subop- 
timal group, 5 of the 6 children were not taking vita- 
min supplements; 5 were Hispanic, 5 were general 
service patients, and 4 were female (all of whom were 
Hispanic). 

On a group basis, there was no statistically signifi- 
cant difference in the mean (+SD) vitamin A levels 
between the subgroup of children taking vitamin sup- 
plements (39.7 + 11.6 g/dL; n = 17) and the sub- 
group of children taking no vitamin supplements 
(38.7 + 10.4 ug/dL; n = 22). Our subjects took a va- 
riety of prescription and over-the-counter vitamin 
preparations; no children were taking or had taken 
cod liver oil; only 1 child in the fatty acid subgroup 
had a past history of fish oil ingestion. Of the 8 dif- 
ferent children’s vitamin preparations we examined, 
all contained vitamin A palmitate, vitamin A acetate, 
and/or beta-carotene. 


Trace Metals. The data for plasma selenium, zinc, 
and copper for the children in the TT group are sum- 
marized in Table 2; data from the published litera- 
ture for children”4 and adults? for these parameters 
are also included, as are International System Units 
(SI units). There was no statistically significant dif- 
ference between the mean (tSD) plasma selenium 
level for study subjects (110 + 16.3 ng/mL; n = 39) 
and the published values for children. Both groups 
of children had selenium levels that were lower than 
the published values for adults (p < .005, ANOVA; F 
= 20.442; F[0.005]3,140 = 4.47).29 


PART 2: CLINICAL SECONDARY PREVENTION 
STUDY 


METHODS 


The blood level data from part 1 of this study re- 
vealed that 1) the study children had lower levels of 
RBC EPA than did the ACs; 2) 69% of the study 
children had vitamin A levels in the lower reference 
range, with some levels in the suboptimal range; and 
3) the study subjects had levels of plasma selenium 
lower than the published values for adults.?25 We 
therefore selected cod liver oil, which contains EPA 
and vitamin A, and a children’s chewable multivita- 
min-mineral preparation containing selenium as the 
nutritional supplements to be used in the clinical por- 
tion of this research. We were unable to identify a 
liquid marketed formulation of a children’s multivi- 
tamin-mineral preparation containing selenium. We 
studied one “otitis media season”: the time period 
from September 1, 2000, through March 31, 2001. 


Study Design. We initially planned a randomized, 
primary prevention study in day-care centers.’ Be- 
cause matching placebos for study supplements were 
not available, we planned to randomize subjects to 
“supplement” and “no supplement” groups after en- 
try into the study. We recruited in a large urban day- 
care center affiliated with a major medical center. 
However, we found that only the parents of children 
with frequent ear infections were interested in the 
study; that these parents would not accept random- 
ization to a “no supplement” group; and that we could 
not enforce such randomization, because the study 
supplements could be purchased in a health food 
store. In addition, although attendance at day care is 
a major risk factor for ear infections, there were few 
children at least 1 year of age with a history of ear 
infections attending the center at which we were re- 
cruiting. We used a lower age limit of 1 year of age 
in this initial clinical study because of the contro- 
versy regarding the use of essential fatty acids in in- 
fants.°3.34 We therefore amended the protocol to an 
open-label, secondary prevention study, with recruit- 
ing from a general pediatrics practice. All protocols 
were approved by the NYEE IRB. Written informed 
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consent was obtained from at least 1 parent before a 
child’s entry into the study. 


Subjects. Eligible children were at least 1 year of 
age and had experienced at least 1 episode of OM 
from September 1 through November 30, 2000 (the 
early portion of the “OM season” under considera- 
tion). We received special permission from the NYEE 
IRB to enroll subject 6, who was only 9.5 months 
(0.8 years) of age on entry. The children were re- 
quired to be in good general health and taking no 
regular medications other than antibiotics; the fami- 
lies were required to be English-speaking. All sub- 
jects were patients of the Soho Pediatrics Group, a 
private pediatric group practice. For recruiting pur- 
poses, this group provided a computer-generated list 
of patients seen with a diagnosis of OM during the 
above time period. The principal investigator (PI; 
L.A.L.) contacted these families by telephone and 
mailed additional information to those who expressed 
an interest in the study. The families were informed 
that all study visits would be home visits. Approxi- 
mately 15% of eligible children enrolled in the study. 
The primary reason for nonenrollment was limited 
parental time due to both parents’ working outside 
the home, with frequent medical visits required for 
their child’s OM; some parents were averse to any 
research requiring their child to take a medication or 
supplement; others were concerned about the taste 
of cod liver oil. 


Study Procedures. The families provided a medi- 
cal history and written permission for the investiga- 
tors to obtain information from their health care pro- 
viders during the course of the study. On entry, they 
were specifically questioned about night blindness 
and use of nutritional supplements. The PI instructed 
the primary caregivers (all mothers) to crush the half- 
tablet of study vitamin-mineral supplement, measure 
1 teaspoon of study cod liver oil, and mix both with 
a small amount of food (such as applesauce, yogurt, 
or rice cereal) before administering the supplements 
to their child. The parents were informed that supple- 
ments were to be given only in the amounts required 
by the study, that the tablets must be crushed for small 
children, and that study supplements were to be kept 
out of reach of children; labels with these instruc- 
tions and warnings were affixed to the study supple- 
ments. The families were given a brief diary in which 
they were asked to record their child’s symptoms and 
the administration of study supplements. The study 
visits were performed at the subject’s home and/or 
mother’s workplace by the PI. The children under- 
went a physical examination at the beginning and end 
of the study; the PI also visited each family at least 
once during the study to monitor progress and sup- 
ply additional study supplements. In addition, the PI 


-contacted each family by telephone at least once a 


month to follow each subject’s status. At each study 
contact, the primary caregiver was questioned regard- 
ing the presence of specific symptoms (decreased ap- 
petite, headaches, vomiting, lethargy, peeling skin, 
and bone pain) that could be related to vitamin A 
toxicity. 


The PI reviewed each subject’s health care records. 
For the time period from September 1, 2000, through 
March 31, 2001, the following information was tabu- 
lated: number of pediatric visits for ear infections 
and any other diagnoses; number of days and name(s) 
of antibiotics prescribed for ear infections; and num- 
ber of days and name(s) of antibiotics prescribed for 
any other infections; number and purpose of visits 
to a pediatric otolaryngologist with dates and types 
of surgery (if any). Maternal diaries (available for 
subjects 1 to 7) were reviewed for administration of 
study supplements and antibiotics given for OM and 
any other diagnoses. 


Study Nutritional Supplements. The children re- 
ceived 1 teaspoon of Carlson’s lemon-flavored cod 
liver oil per day (selected for its acceptable taste), 
and 1 half-tablet of Carlson’s Scooter Rabbit chew- 
able vitamins and minerals (chosen because it con- 
tains selenium and has an acceptable taste); the sup- 
plements were administered as described above 
(Study Procedures). The total daily dose of omega-3 
fatty acids was 45 to 50 mg of alpha linolenic acid 
(ALA); 460 to 500 mg of EPA; and 500 to 550 mg of 
docosahexaenoic acid (DHA). Recommended Daily 
Allowances (RDAs) for these fatty acids have not 
been established. The total daily dose of some vita- 
mins used in this study was above the RDA, but well 
below the lowest daily intakes considered to be tox- 
ic.30.35 The total daily doses of the fat-soluble vita- 
mins were 4,500 to 5,000 IU of vitamin A (2,000 to 
2,500 IU from cod liver oil and 2,500 IU as palmi- 
tate); 600 to 700 IU of vitamin D (400 to 500 IU 
from cod liver oil and 200 IU as D3); and 31 IU of 
vitamin E (1 IU from cod liver oil). For these vita- 
mins, the lowest daily intakes considered to be toxic 
are 19,860 IU/d of vitamin A30; 2,800 to 4,000 IU/d 
of vitamin D30; and 600 IU/d of vitamin E (100 times 
the nutritional requirement).35 


The total daily doses of other vitamins were 20 
ug of vitamin K; 60 mg of vitamin C; 0.75 mg of 
thiamine (vitamin B1); 0.85 mg of riboflavin (vita- 
min B2); 2.5 mg of niacin; 1.0 mg of vitamin B6 (py- 
ridoxine); 100 ug of folate (folic acid); 3 ug of vita- 
min Bi2 (cyanocobalamin); 15 ug of biotin; and 5 
mg of pantothenic acid. The total daily doses of min- 
erals were 25 mg of calcium; 4.5 mg of iron; 11 mg 
of phosphorus; 37.5 ug of iodine; 12.5 mg of mag- 
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Comments 





First % Days of Antibiotics for 
Otitis Otitis Media 
Media Before On Compliance 
Sept 1 to Regi Regi m 
Subject Start Age Nov 30, % cannes egimen (%) 
No. Date Sex Race {y) 2000 Response Difference* % Days % Days CLO MVM 
1 Decl, M W 1.3 Yes Yes -21.6 21.6 22/102 0.0 0/110 58.0 83.0 
2000 
2 Decl4, M W 12 No No -0.3 9.6 10/104 9.3 10/108 94.0 93.0 
2000 
3 Decl4, M W 44 No Yes -6.7 67 7/104 0.0 0/108 91.0 91.0 
2000 i 
4 Dec2i, F W 22 Yess Yes -36.0 36.0 30/111 0.0 0/59 81.0 81.0 
2000 
5 Ja9 F W 32 No Yes -15.4 15.4 20/130 0.0 0/82 98.0 83.0 
2001 
6 Jan10, F W 08 Yes N/A N/A 28.0} 37/132t N/A ED — N/A 
2001 
7 Janl0, M W 34 No No 3.1 22.9 30/131 26.0 20/76 97.0 99.0 
2001 
8 Jan6, F Mixed 2.0 Yes Yes -9.4 94 12/127 0.0 0/84 SS None 
2001 


URI during supplementation 

“on”) resolved. On Feb 1, 
mother decreased dose of 
CLO to 1/2 teaspoon per day 
for compliance reasons. 


Brother of subject 3. Initial 
URI during supplementation 
resolved; second URI 
(developed when his mother 
and brother had URIs) 
progressed to AOM. 


Brother of subject 2. On 
entry, status post bilateral 
tympanostomy tubes (Jan 28, 
2000), only R tube intact. 
During supplementation, URI 
with small amount of serous 
drainage from R ear that 
resolved without treatment. 
Pediatric otolaryngologist on 
Mar 16, 2001: persistent L 
middle ear effusion; new tube 
might be necessary. 


Began preschool in Sep 

2000. Bronchitis at start of 
supplementation treated with 
azithromycin for 5 days. URE 
during supplementation 
resolved, Withdrawn on Feb 
17, 2001, when parents read 
Internet Greenpeace article . . 
(see text). 

URIs during supplementation 
resolved. Per mother, this 
was “the first time in two 
years that a cold didn’t go 
into an ear infection or a bad 
chest thing.” Took Animal 
Parade MVM, not study 
MVM. 


Withdrawn after 10 days; 
would not take CLO. Then 
given antibiotic prophylaxis, 
pneumococcal vaccine; day 
care discontinued; persistent 
bilateral mild serous 
effusions. 


_End of study Mar 28, 2001: 


bilateral adenoidectomy and 
tympanostomy tubes. 

SS = Seven Seas CLO, no 
vitamins. URIs during 
supplementation resolved. 


CLO — cod liver oil, MVM — multivitamin-mineral supplement, W — white, URI — upper respiratory tract infection, AOM — acute otitis 


media, ED — early dropout. 
*Mean (+SD) difference = —12.3% + 13.4; p < .05. 
{Before regimen” included 3 injections of ceftriaxone. 





nesium; 3.75 mg of zinc; 17.5 ug of selenium; 0.5 
mg of copper; 0.88 mg of manganese; 30 ug of chro- 
mium; 19 ug of molybdenum; 1.25 mg of potassium; 
and 5 mg of lemon bioflavonoids. 


Efficacy. We examined the time interval from Sep- 
tember 1, 2000, through March 31, 2001. The “be- - 
fore” period was defined as September 1, 2000, to 
the date of study enrollment (Table 3). The “on” or 
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Percent Days of Antibiotics 








BEFORE % ON% 


Percent days of antibiotics. Subjects received antibiotics 
for otitis media for 12.3% + 13.4% (mean + SD) fewer 
days during supplementation (ON%) than before (BE- 
FORE%); p < .05. 


supplementation period was defined as the date of 
enrollment through the end of the study (March 31, 
2001, for all subjects except 4 and 7). Subject 4 last 
received study supplements on February 17, 2001 
(see Dropouts, below); subject 7 last received study 
supplements on March 27, 2001, because he under- 
went adenoidectomy and placement of TTs the fol- 
lowing day. The number of days of antibiotics for 
OM was determined from each subject’s medical rec- 
ords in conjunction with maternal diaries. The per- 
centage of days of antibiotics for OM was calculated 
for the “before” and “on” periods; statistical compari- 
sons were performed with the nonparametric Wilcox- 
on matched pairs test.28.29 


Otitis Media Season. Otitis media is a seasonal 
illness, with a peak incidence in the winter months.>® 
As an overall reflection of the duration of the “OM 
season” during the time period our study was per- 
formed, we evaluated the number of monthly visits 
for OM at the Soho Pediatric Group from Septem- 
ber 1, 2000, through March 31, 2001. The Group pre- 
pared a computer printout of all visits during this 
time period with a diagnosis code of OM (with or 
without perforation); this included visits for AOM, 
as well as those for OM with effusion; visits coded 
as a well child visit, during which a follow-up of 
OM was also performed, were not included; visits 
by study subjects were subtracted from the totals pro- 
vided. The monthly totals for October 2000 through 
March 2001 were compared to the monthly total for 
September 2000 by x? analysis.29 To correct for mul- 
tiple comparisons, we applied the Bonferroni correc- 
tion: & = 0.05 divided by the number of compari- 
sons (6) = corrected & of 0.008.37 


Compliance. The number of days the subjects re- 
ceived study supplements was determined from ma- 


ternal diaries. This number was divided by the num- 
ber of days in the supplementation (“on”) period to 
determine the percent compliance, which was mea- 
sured separately for cod liver oil and the multivita- 
min-mineral supplement (Table 3). 


RESULTS 


Demographics. Eight subjects entered the clinical 
secondary prevention study. Unlike the subjects in 
part 1 of this study, all were private outpatients of 
the Soho Pediatric Group; 7 of the 8 were white, none 
were Hispanic, and half were girls (Table 3). Seven 
took the study nutritional supplements supplied by 
the investigators. After reviewing the study informa- 
tion, and before we were able to arrange for formal 
enrollment in the study, the mother of the eighth sub- 
ject began giving her daughter cod liver oil that she 
purchased on her own. This mother was from the 
English-speaking Caribbean, where the daily admin- 
istration of cod liver oil to young children was re- 
portedly a common practice. This family cooperated 
with other aspects of the study and gave consent to 
allow us to obtain copies of their child’s medical rec- 
ords for inclusion in the study. 


Dropouts. Subject 6 dropped out of the study af- 
ter 10 days because she could not tolerate the taste 
of the cod liver oil, despite repeated attempts by her 
mother to administer small amounts in several dif- 
ferent foods and drinks. Subject 4 received supple- 
ments for 59 days; she dropped out of the study on 
February 17, 2001, after her parents read an Internet 
article by Greenpeace regarding the dangers of fish 
oil contamination with polychlorinated biphenyls 
(PCBs) and dioxins.38 Despite the excellent purity 
data supplied by the Norwegian manufacturer of the 
study cod liver oil (Peter Möller, Oslo, Norway, per- 
sonal communication, 2001) and the subject’s ap- 
parently favorable response, her parents refused to 
continue with the study. This child had received an- 
tibiotics for OM for 36% ofthe days (40/111) in the 
“before” period, before starting study supplements; 
during the “on” or supplementation period, she had 
no episodes of OM, and only 1 episode of bronchitis 
early in this phase, treated with a 5-day course of 
azithromycin (8/59 or 8.5%; Table 3). This subject 
was therefore classified as a responder. We informed 
the NYEE IRB, the Soho Pediatric Group, and the 
other families in the study of this situation; the study 
was continued as originally planned. 


Adverse Reactions. No clinically significant ad- 
verse reactions due to study supplements occurred 
during the study, and there was no evidence of signs 
or symptoms of vitamin A toxicity. 


Percentage of Days of Antibiotics for Otitis Me- 
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dia. All subjects except No. 6 (who dropped out af- 
ter 10 days; see Dropouts, above) were included in 
the efficacy analysis, summarized in Table 3 and the 
Figure. The percentage of days of antibiotics for OM 
was calculated for the “before” and “on” periods as 
described above (Data Analysis). The mean (+SD) 
decrease between the 2 periods was 12.3% + 13.4%, 
(p < .05, Wilcoxon matched pairs test?8:29; N = 7; t= 
2.000; Z = 2.028). The statistical analysis was un- 
changed when subject 4’s treatment with antibiotics 
for bronchitis during the supplementation period (5 
days; 8.5%) was included (Wilcoxon matched pairs 
test). Five of the 7 subjects (Nos. 1, 3, 4, 5, and 8) 
were classified as responders; they had no additional 
ear infections treated with antibiotics while receiv- 
ing study supplements. The parents of each of the 
responders reported that their child had at least 1 up- 
per respiratory tract infection during supplementa- 
tion that resolved without additional treatment. Two 
subjects (Nos. 2 and 7) were classified as nonrespond- 
ers; each had an additional episode of OM during 
the supplementation phase, and the percentage of days 
of antibiotics for OM for each was essentially un- 
changed during supplementation. 


Two subjects (Nos. 3 and 7) had persistent serous 
middle ear effusions that did not resolve during sup- 
plementation, as assessed by their private pediatric 
otolaryngologist. Subject 7 underwent surgery at the 
end of the study for placement of bilateral TTs and 
adenoidectomy. At the study’s end, subject 3’s pri- 
vate pediatric otolaryngologist advised his mother 
that repeat insertion of a TT in his left ear might be 
required because of a persistent serous effusion. 


Otitis Media Season. We evaluated the number of 
monthly visits for OM at the Soho Pediatric Group 
from September 1, 2000, through March 31, 2001 
(see Data Analysis, above). The monthly totals were 
26 for September; 51 for October; 81 for November; 
95 for December; 88 for January; 74 for February; 
and 93 for March. The number of monthly visits for 
each month from October 2000 through March 2001 
was significantly higher than that for September 2000 
(p < .001; 770.001,1 = 10.828; Bonferroni corrected 
o = 0.008 [calculated as & = 0.05 divided by 6 com- 
parisons37]). Thus, the “OM season” was not over 
during the second half of our study. 


DISCUSSION 


The blood level determinations performed in part 
1 of this research revealed that 1) the study children 
had lower levels of RBC EPA than did the ACs; 2) 
consistent with previous reports,3! 69% of the study 
children had vitamin A levels in the lower reference 
range, with some levels in the suboptimal range; and 


3) the study subjects, like other children,?4 had lev- 
els of plasma selenium lower than the published val- 
ues for adults.2> Eicosapentaenoic acid is a precur- 
sor of critical anti-inflammatory eicosanoids?; vita- 
min A is known as the “anti-infective” vitamin!?; and 
selenium is an essential component of glutathione 
peroxidase, one of the key free radical-scavenging 
enzymes.40 We hypothesized that lower levels of 
these important nutrients predispose some children 
to acute OM, and we tested this hypothesis in a small 
clinical study (part 2 of this research). 


In the clinical study, we chose cod liver oil as a 
source of both vitamin A and EPA, and used it in 
conjunction with a marketed children’s chewable 
multivitamin-mineral preparation that contains se- 
lenium. We specifically chose cod liver oil as a source 
of vitamin A on the basis of our blood level data, 
which revealed no significant difference in the vita- 
min A levels between the subgroups of children who 
were taking vitamin supplements and those who were 
not. We could have used fish oil as the source of EPA, 
but fish oil does not contain vitamin A or D. 


In our clinical study of 8 children, 1 subject could 
not tolerate the taste of cod liver oil. The remaining 
7 children received antibiotics for OM for a mean 
(SD) of 12.3% + 13.4% fewer days during supple- 
mentation than before, and 5 of the 7 children experi- 
enced no additional ear infections during supplemen- 
tation. These improvements occurred despite the fact 
that the “OM season” was not over during the sec- 
ond half of our study, and the responders continued 
to experience upper respiratory tract infections dur- 
ing this time. However, because our study lacked a 
randomized, parallel control group, we cannot exclude 
the possibility that the decreased antibiotic usage we 
found might have occurred without the use of study 
supplements. Although children were required to have 
had at least 1 episode of OM during the OM season 
under investigation, our pilot study was not limited 
to the subgroup of children with repeated episodes 
of OM. The results of this initial study require con- 
firmation in larger controlled studies, in which the 
issue of subgroup response could also be addressed. 
The study supplements had no apparent benefit on 
established serous middle ear effusions in 2 children. 
In future studies, we therefore propose that supple- 
mentation begin as soon as possible after a child’s 
first episode of OM, although a decreased dosage 
may be required for compliance reasons in younger 
infants. 


Issues of Study Design. Large (adequately pow- 
ered), randomized, placebo-controlled, parallel-group 
studies are the “gold standard” for clinical trials. We 
performed a small, open-label clinical study that was 
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not randomized. We did this because matched place- 
bos for our study supplements were not available; 
the parents were not willing to accept randomization 
to a “no supplement” group, and we could not have 
enforced randomization to the latter, because the nu- 
tritional supplements under evaluation could be read- 
ily purchased in a health food store. 


Matched placebos for study supplements would 
facilitate compliance with randomization. Formulat- 
ing a matched placebo for a multivitamin-mineral 
supplement is a practical consideration. However, be- 
cause of its distinctive odor and taste, formulating a 
matched placebo for liquid cod liver oil would be 
very difficult (Susan Carlson, RPh, personal commu- 
nication, 2000). Adults and older children can swal- 
low capsules; infants and toddlers cannot. Further- 
more, if capsules were cut open to administer the 
contents, the distinctive odor and taste of cod liver 
oil would be immediately apparent. 


Watchful Waiting and Acute Otitis Media. Studies 
from the Netherlands*! and the United Kingdom* 
support the use of a period of “watchful waiting” for 
most young children initially presenting to a general 
practitioner with AOM, rather than the immediate 
use of antibiotics. We were not able to find data on 
cod liver oil consumption in these reports. In future 
studies, it would be useful to determine whether the 
use of study supplements could facilitate the use of 
“watchful waiting” in the United States. 


Omega Fatty Acids. Fatty acids containing more 
than 1 double bond are classified as polyunsaturated 
fatty acids (PUFAs). There are 2 series of PUFAs, 
the omega-3 and omega-6 fatty acids, named for the 
distance of the first double bond from the methyl end 
of the molecule.*3 These 2 series are not interconver- 
tible in the human body; they are metabolically and 
functionally distinct, compete for the same desatura- 
tion enzymes, and have opposing physiological func- 
tions.43 Their proper balance is important for homeo- 
stasis and normal development.*3 


Eicosapentaenoic acid and DHA are 2 important 
omega-3 fatty acids. The brain and retina are rich in 
DHA,>4 which is necessary for normal visual and 
neural development.33 Eicosapentaenoic acid is an 
important precursor of anti-inflammatory eicosa- 
noids, while the omega-6 fatty acids (especially arach- 
idonic acid) are precursors of inflammatory eicosa- 
noids.3? In view of the role of inflammation in the 
pathogenesis of OM discussed above (Introduction), 
we believe that the lower levels of EPA in children 
reported herein are particularly relevant. Further- 
more, the level of omega-3 fatty acids and other 
PUFAs is largely determined by diet, unlike the lev- 


els of proteins, whose structure is genetically deter- 
mined.4344 Omega-3 fatty acid levels can be in- 
creased by eating more fatty fish and by consuming 
nutritional supplements such as cod liver oil or fish 
oil. 


Vitamin A and Cod Liver Oil. Vitamin A, called 
the “anti-infective” vitamin, plays a role in modulat- 
ing a wide spectrum of immune responses.!2 World- 
wide, the rate of mortality is high among children 
with severe vitamin A deficiency and xerophthalmia 
(corneal ulceration and keratomalacia, or dry and in- 
sensitive eyes), with in-hospital death rates averag- 
ing 15% to 25% in such children. In developing coun- 
tries, even milder vitamin A deficiency (associated 
with night blindness and Bitot’s spots) is associated 
with a marked increase in mortality.45 


None of our subjects had symptoms of night blind- 
ness, and none of the subjects in the blood level study 
had a vitamin A level in the deficient range. How- 
ever, consistent with previous research,>! 69% of the 
children in the blood level study (27/39) had vita- 
min A levels of <40 g/dL, the level that predicts 
normal dark adaptation 95% of the time.32 The over- 
representation of Hispanic children in the 15% of our 
subjects (6/39) with suboptimal vitamin A levels is 
also consistent with previous reports.?! 


Cod liver oil contains vitamins A and D in addi- 
tion to the omega-3 fatty acids EPA and DHA. From 
the 1920s to the 1940s, many children in the United 
States were given a teaspoon of cod liver oil each day 
as a source of vitamins A and D,® and orange juice 
as a source of vitamin C. However, older prepara- 
tions of cod liver oil had an unpleasant taste, and 
professionals were concerned about vitamin A toxic- 
ity* and lipoid aspiration pneumonia.‘ In the United 
States, cod liver oil has been largely replaced by syn- 
thetic vitamin preparations; however, the latter do not 
contain omega-3 fatty acids.*3 


In contrast to the situation in the United States, 
the Norwegian Nutrition Council recommends the 
consumption of 1 teaspoon of cod liver oil per day, 
beginning at 4 weeks of age. The primary purpose 
of this recommendation is the prevention of rickets 
in persons with limited exposure to sunlight; cod liver 
oil is recommended because it provides essential fatty 
acids in addition to vitamin D.48 


Polychlorinated Biphenyls and Dioxins. One of 
our responders (subject 4) dropped out of our study 
after her parents read an Internet article by Green- 
peace regarding the dangers of fish oil contamina- 
tion with PCBs and dioxins.*8 


The amount of PCB and dioxin residues in cod 
liver oil and fish oil varies with the product exam- 
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ined,‘ and there is no uniform standard for a TDI 
(Tolerable Daily Intake) for these substances.>° Us- 
ing data provided by the manufacturer of the cod liver 
oil used in this study (Peter Möller, Oslo, Norway, 
personal communication, 2001), we calculated that 
a 10-kg child taking 1 teaspoon per day of this cod 
liver oil would receive 0.5 pg of World Health Orga- 
nization toxic equivalents (WHO-TEQ) per kilogram 
of body weight per day. This figure is less than the 
United Kingdom TDI for dioxins and dioxinlike 
PCBs of 10 pg WHO-TEQ per kilogram of body 
weight per day. It is also below the TDI proposed by 
the WHO of 1 to 4 pg WHO-TEQ per kilogram of 
body weight per day; the latter has not yet been en- 
dorsed in the United Kingdom.>° 


Selenium. Selenium is a structural component of 
the active center of glutathione peroxidase,*° one of 
the key free radical-scavenging enzymes that pro- 
tect against oxidative stress. In an animal model, the 
ingestion of high doses of omega-3 fatty acids in- 
creased the susceptibility of cell membranes to oxi- 





dative stress.°! We therefore consider it prudent to 
bolster the free radical—scavenging enzyme system 
when supplementing children with cod liver oil, al- 
though this issue requires further evaluation. 


CONCLUSION 


We propose that the lower levels of EPA and sele- 
nium in children, when combined with vitamin A lev- 
els in the lower reference range, predispose some 
children to OM. We believe that larger, controlled 
trials are warranted to assess the utility of cod liver 
oil (of acceptable purity and taste) and a children’s 
multivitamin-mineral preparation containing sele- 
nium, for the prevention of OM in infants and young 
children. The effect of supplementation on parental 
acceptance of delayed prescription of antibiotics for 
this disorder also deserves scrutiny. However, the de- 
sign and performance of future studies will continue 
to be hampered by difficulties in formulating a pla- 
cebo for liquid cod liver oil, and by the fact that these 
supplements can be readily purchased in a health food 
store. 
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The aim of this study was to evaluate the perilymphatic pressure, by means of the MMS-10 Tympanic Displacement Analyzer 
(TDA), in patients with Meniere’s disease (MD). Measurements were performed in 37 patients with MD and in 14 normal-hearing 
subjects. Data were collected from 3 groups: healthy ears of normal-hearing subjects, unaffected ears of patients with MD, and 
affected ears of patients with MD. Analysis of the results obtained with the TDA showed no significant differences between the 3 
groups. Several hypotheses could explain this lack of difference: 1) perhaps indirect measurement of perilymphatic pressure with the 
TDA is not relevant; 2) perhaps hyperpressure of the inner ear is not the physiological basis for the clinical triad of MD; or 3) perhaps 
endolymphatic hydrops does not produce changes in perilymphatic pressure. The results of this series indicate that the TDA is not 
useful in the evaluation of patients with MD. 


KEY WORDS — inner ear pressure, Meniere’s disease, perilymphatic fluid. 
INTRODUCTION 
The MMS-10 Tympanic Displacement Analyzer 


fluid pressure. The present study was conducted in 
order to evaluate the results of the TDA in patients 


(TDA) is a new device that was introduced as a non- 
invasive method to evaluate perilymphatic and/or in- 
tracranial pressure changes.! This system was first de- 
veloped to investigate intracranial pressure changes, 


with Meniere’s disease (MD). 


The TDA is based on the following principles.!3 
The TDA can measure small variations in volume 
displacement in the external auditory canal, produced 


with particular interest in the survey of children pre- 
senting with intracranial hypertension syndrome 
treated by ventricular shunting.” 


by movements of the tympanic membrane, induced. 
by the stapedial reflex. The movements of the tym- 
panic membrane during stapedial reflex contraction 
depend on the resting position of the stapes footplate 
in the oval window, which is directly related to the 


The TDA may also provide an indirect and nonin- 
vasive method for the assessment of the inner ear 
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Fig 1. Different types of curves that may be re- Type 1 Type 2 
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Fig 2. Quantitative parameters (Vi and Vm) measured 
with Tympanic Displacement Analyzer. 


perilymphatic pressure. A high perilymphatic pres- 
sure displaces the stapes footplate laterally. On con- 
traction of the stapedius muscle, the footplate will 
move medially, resulting in an inward displacement 
of the tympanic membrane. The representation of this 
movement (indicating a high perilymphatic pressure) 
is a negative curve. Conversely, for normal or low 
perilymphatic pressure, the footplate rests more me- 
dially. On stapedial reflex contraction, the footplate 
moves laterally, resulting in an outward displacement 
of the tympanic membrane. The representation of this 
displacement (indicating normal or low perilymphat- 
ic pressure) is a positive or biphasic curve. 


The perilymphatic pressure is directly related to 
intracranial pressure. The major connection route be- 
tween the perilymphatic compartment and the sub- 
arachnoid space is the cochlear aqueduct. The pa- 
tency of the cochlear aqueduct seems to decrease with 
age.° Intracranial pressure is higher in the supine po- 
sition than in the sitting position. This difference will 
be reflected on tympanic membrane displacement if 
the cochlear aqueduct is patent. In contrast, if the 
cochlear aqueduct is not patent, no difference will 
be observed in tympanic membrane displacement be- 
tween the sitting and supine positions. 


MATERIAL AND METHODS 


In this study, 37 patients with MD (21 women and 
16 men; mean age, 47 years; range, 18 to 77 years) 
and 14 normal-hearing subjects (8 women and 6 men; 
mean age, 44 years; range, 18 to 55 years) were en- 
rolled from May 1997 through December 1998. A 
diagnosis of MD was found according to the criteria 
of the American Academy of Otolaryngology—Head 
and Neck Surgery®; 23 patients were classified as 


having definite MD, and 14 patients were classified 
as having probable MD. All patients had had MD 
for at least 6 months. Thirty-four patients presented 
with unilateral MD, and 3 with bilateral MD. In all 
patients, the diagnostic protocol consisted of a routine 
otolaryngological examination, audiometric tests, 
vestibular tests, auditory brain stem response audiom- 
etry, and/or magnetic resonance imaging of the cere- 
bellopontine angle. The normal-hearing subjects pre- 
sented no otologic or neurotologic history and had 
normal findings on standard audiometric examina- 
tion. : 


The ears included in this study were divided into 
3 different groups: group 1, control ears of normal- 
hearing subjects (n = 28); group 2, nonaffected ears 
of MD patients (n = 34); and group 3, affected ears 
of MD patients (n = 40). 


All of the tests were performed by the same opera- 
tor, using the MMS-10 TDA (Marchbanks Measure- 
ment Systems Ltd, Lymington, England). Assessment 
of middle ear function was accomplished before test- 
ing (normal tympanometry; measurement of ipsilat- 
eral acoustic reflex threshold at 1 kHz). A probe at- 
tached to a headset sealed the external auditory canal 
and was connected to a personal computer. The dura- 
tion of the stimulus was 500 ms at 1 kHz. The stim- 
ulation intensity was set at 15 dB or more above the 
stapedial reflex threshold at this same frequency. 


Qualitative assessment of the results may reveal 5 
types of curves (Fig 1): type 1, a positive curve; type 
2, a biphasic curve with a small inward displacement 
followed by a large outward movement; type 3, a neg- 
ative curve that has an initial significant negative- 
going curve followed by an outward displacement, 
but does not become positive before the end of the 
stimulus; type 4, a typical negative curve; and type 
5, a poor-quality recording that does not lead to any 
contributive conclusion. Type 1 and 2 curves reflect 
a normal or low perilymphatic pressure. Type 3 and 
4 curves reflect a high perilymphatic pressure. 


Two parameters, Vi and Vm, are used to quantify 
volume variations in the external auditory canal in- 
duced by the displacement of the tympanic membrane 
after stapedial contraction. Vi is the maximum inward 
displacement of the tympanic membrane during 


TABLE 1. QUALITATIVE RESULTS OF TDA MEASUREMENTS FOR THREE GROUPS IN SITTING POSITION 








Type 1 Type 2 
Group 1 (N = 28) 16 (57.1%) 
Group 2 (N = 34) 17 (50.0%) 
Group 3 (N = 40) 19 (47.5%) 


See text for description of groups and curve types. 
TDA — Tympanic Displacement Analyzer. 


7 (25.0%) 
7 (20.6%) 
6 (15.0%) 





Type 3 Type 4 Type 5 
2 (7.1%) 2 (7.1%) 1 (3.5%) 
3 (8.8%) 4 (11.8%) 3 (8.8%) 
3 (7.5%) 8 (20.0%) 4 (10.0%) 
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TABLE 2. QUALITATIVE RESULTS OF TDA MEASUREMENTS FOR THREE GROUPS IN SUPINE POSITION 














Type 1 Type 2 
Group 1 (N = 28) 9 (32.1%) 8 (28.6%) 
Group 2 (N = 34) 15 (44.1%) 6 (17.6%) 
Group 3 (N = 40) 16 (40.0%) 6 (15.0%) 








Type 3 Type 4 Type 5 
3 (10.7%) 4 (14.3%) 4 (14.3%) 
3 (8.8%) 6 (17.6%) 4 (11.8%) 
4 (10.0%) 10 (25.0%) 4 (10.0%) 





acoustic stimulation (Fig 2). Vm is the mean displace- 
ment between the moment at which Vi is reached 
and the end of the acoustic stimulus (Fig 2). As they 
measure volume variations, Vi and Vm are both ex- 
pressed in nanoliters and are proposed to be related 
to perilymphatic pressure. Large negative values of 
Vi and Vm reflect high perilymphatic pressure, 
whereas Vi values near zero and positive Vm values 
reflect normal or low perilymphatic pressure. 


To assess cochlear aqueduct patency, we used the 
ratio dVi/Vi proposed by Phillips and Marchbanks.’ 
The difference between Vi values obtained with the 
subject sitting and those obtained with the subject 
supine is divided by the Vi value obtained with the 
subject supine (dVi/Vi). The aqueduct is considered 
patent if dVi/Vi > 0.1. 


In all ears, the TDA measurements were done in 
` sitting and supine positions. The results were assessed 
both qualitatively (type of curve) and quantitatively 
(Vi and Vm). Cochlear aqueduct patency was calcu- 
lated with the formula dVi/Vi. The results were ana- 
lyzed with the Fisher exact test. Results with’a p value 
of less than .0S were considered to be statistically 
significant. 
RESULTS 


The types.of curves recorded in each group are 
presented in Tables 1 and 2. No statistically signifi- 
cant difference (p > .05) was found between the 3 
groups in comparing the curves reflecting high peri- 
lymphatic pressure (types 3 and 4) with the curves 
reflecting normal or low perilymphatic pressure 
(types 1 and 2). 


As shown in Table 3, we found large interindivid- 


ual differences for the measured Vi and Vm variables. 
The calculations performed for cochlear aqueduct pa- 
tency (dVi/Vi) in 28 ears are presented in Table 4. 
No difference was found in cochlear aqueduct pa- 
tency between the different groups. 


Measurements were done during an attack of MD 
in 23 ears and during a remission of the disease in 17 
ears. The qualitative results are reported in Table 5. 
There was no statistically significant difference be- 
tween these 2 subgroups. 


DISCUSSION 


The TDA is a new device that provides a noninva- 
sive method for measurement of perilymphatic pres- 
sure. Because of the relationships between cerebro- 
spinal fluid pressure and perilymphatic pressure, the 
TDA was first used to assess intracranial pressure 
changes, especially in children with shunted hydro- 
cephalus.” 


According to Marchbanks,® the TDA detects high 
perilymphatic pressure resulting from intracranial hy- 
pertension with a sensitivity of 95% and a specificity 
of 74%. Thus, normal intracranial pressure shows a 
type 3 or type 4 response in 26% of cases. Conversely, 
5% of patients with raised intracranial pressure show 
a type 1 or type 2 response. 


By measuring cochlear pressure changes, the TDA 
seems to be a promising tool to investigate some hear- 
ing and balance disorders. !.9-!0 The otologic manifes- 
tations of the X-linked progressive mixed deafness 
syndrome with perilymphatic gusher are related to 
high perilymphatic pressure. In such cases, stapes 
surgery can result in severe hearing loss. Ernst et al!! 
reported the helpful application of the TDA in the 





TABLE 3. QUANTITATIVE MEASUREMENTS OF TDA IN THREE GROUPS 











Vi (nL) 
Group 1 (N = 28) Mean —31.0 
Range -319 to 0 
SD 70.5 
Group 2 (N = 34) Mean -53.6 
Range —613 to 0 
SD 122.8 
Group 3 (N = 40) Mean —48.3 
Range -324 to 0 
SD 82.8 


See text for definitions of Vi and Vm. 














Sitting 
Vm (nL) Vi (nL) Vm (nL) 

100.8 ~46.9 66.9 

—264 to 386 —328 to 0 —330 to 409 
125.4 84.4 162.9 
27.3 -72.7 -6.1 

—613 to 475 —427 to 0 —409 to 286 
196.9 118.3 146.9 
42.4 -84.5 -2.0 

—320 to 389 —396 to 0 —365 to 335 
154.1 118.5 164.7 
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TABLE 4. RESULTS OF COCHLEAR AQUEDUCT 





PATENCY CALCULATIONS 
aVilVi > 0.1 dVilVi < 0.1 
Group 1 (N = 14) 9 (64.3%) 5 (35.7%) 
Group 2 (N = 18) 15 (83.3%) 3 (16.7%) 
Group 3 (N = 21) 16 (76.2%) 5 (23.8%) 


See text for definition of dVi/Vi. 








prevention of complications of stapes surgery in peri- 
lymphatic gusher by preoperative detection of high 
intralabyrinthine pressure. Audiovestibular manifes- 
tations of intracranial hypertension were well de- 
scribed by Sismanis,!2 who reported the high prev- 
alence of tinnitus, dizziness, vertigo, aural fullness, 
and hearing loss in benign intracranial hypertension 
syndrome (BIH). This prevalence was also estab- 
lished by Marchbanks,® who reported that BIH pre- 
sented as a Meniere’s-like disease. He advocated the 
use of the TDA in MD in order to detect patients 
with audiological symptoms who have intracranial 
hypertension as an underlying cause. 


In a series of 25 patients with MD, Bouccara et 
al,!3 using the TDA, described 2 subgroups based 
on normal versus high perilymphatic pressure. In con- 
trast with these results, Rosingh et al!415 and Gose- 
path et al! did not find any differences between pa- 
tients with MD and normal-hearing subjects when 
assessing perilymphatic pressure with TDA measure- 
ments. The analysis of the results of our series has 





TABLE 5. QUALITATIVE RESULTS OF TDA 
MEASUREMENTS IN AFFECTED EARS WHEN 
PERFORMED DURING ATTACK OR DURING 











REMISSION PERIOD 
Types Types 
l and2 3and4 Type 5 
(sitting/ (sitting!  (sitting/ 
Affected Ears (N = 40) supine) supine) supine) 
Attack period (N = 23) 13/14 8/7 2/2 
Remission period (N = 17) 12/8 3/7 2/2 





led us to the same conclusions as Rosingh et al and 
Gosepath et al. With a similar methodological ap- 
proach, in our series comparing 37 patients with MD 
and 9 normal-hearing subjects, we also did not find 
any difference between affected ears, unaffected ears, 
and control ears. Moreover, we did not detect any dif- 
ference between the 3 groups when assessing co- 
chlear aqueduct patency with TDA. 


In our experience, the TDA did not provide rele- 
vant information on patients with MD. Our findings 
could be explained in various ways: 1) the technique 
may not be adapted; 2) variation in labyrinthine hy- 
drodynamics may not be the only physiopathological 
concept to support the clinical triad of MD; or 3) 
endolymphatic hydrops may not produce changes in 
perilymphatic pressure. Further studies are needed 
to clarify these questions. In any case, we conclude 
that the results of this series show that TDA does not 
seem to be useful in the evaluation of MD. 
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IMAGING CASE STUDY OF THE MONTH 


ISOLATED CONGENITAL AGENESIS OF THE OLFACTORY BULBS AND 
TRACTS IN A CHILD WITHOUT KALLMANN’S SYNDROME 


LINO DI RIENZO, MD 
ROME, ITALY 


ALBERTO ARTUSO, MD 
ROME, ITALY 


INTRODUCTION 


Olfactory dysfunctions are rarely found in young 
children, and in most cases are the consequence of 
chronic nasal obstruction with or without a related 
infection. One or more disorders such as hypertrophy 
of the adenoids, hypertrophy of the inferior and mid- 
dle turbinates, nasal polyps, deviation of the nasal 
septum, and juvenile angiofibroma are the principal 
causes of hyposmia and, more rarely, of anosmia in 
small children. Rhinitis and adenoiditis (allergic, in- 


fective, or irritative) are often the cause of olfactory. 


dysfunctions such as hyposmia and parosmia. Olfac- 
tory dysfunctions in small children are more rarely 
caused by craniofacial trauma, endocrine disorders 
(Addison’s disease, dysfunctions of the thyroid), and 
brain tumors. The most frequent cause of congenital 
anosmia is associated with Kallmann’s syndrome, a 
hereditary, X-linked, genetic-olfactory dysplasia 
characterized by the phenotypic link between hypo- 
gonadotropic hypogonadism and anosmia, affecting 
about 1 in 10,000 men and 1 in 50,000 women (KAL- 
1, the gene that encodes the glycoprotein anosmin- 
1, which is a component of the extracellular matrix 
during organogenesis and is responsible for the syn- 
drome, has been identified).!+ The anosmia is caused 
by the absence of olfactory bulbs and tracts, whereas 
the hypogonadism is of hypothalamic origin and is 
caused by gonadotropin-releasing hormone (Gn-RH) 
- deficiency.2> The link between anosmia and hypo- 
- gonadism is explained by the common origin in the 
embryological development between the olfactory 
neurons and Gn-RH secreting neurons, which are ar- 
rested in their migration.2° In some patients with 
Kallmann’s syndrome, signs of other malformations 
(mainly, midline defects) and of synkinesis, abnormal 
eye movement, cerebellar dysfunction, nystagmus, 
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cavus foot, unilateral renal agenesis, cleft palate, 


‘cryptorchidism, osteopenia, femur-fibula-ulna dys- 


ostosis, obesity, sensorineural hearing loss, and car- 
diac abnormalities can be found.7-10 Other congenital 
diseases, including Down syndrome, Turner’s syn- 
drome, Riley-Day syndrome, and Pena-Shokeir syn- - 
drome, are associated with olfactory dysfunction. t112 
On the other hand, until now, complete absence of 
olfactory bulbs and tracts has been found only in Kall- 
mann’s syndrome, in holoprosencephaly, and in ar- 
rhinencephalia.!3 A rare variation of Kallmann’s syn- 
drome, in which a partial gonadotropin deficiency 
(fertile eunuch) accompanied by anosmia is present, 
has also been described. !4 


The more common magnetic resonance imaging 
(MRI) appearances of Kallmann’s syndrome have 
been described in a few articles,!5-!8 with emphasis 
on the significance of thin-section Tl-weighted im- 
ages to evaluate the status of the olfactory bulbs and 
tracts, as well as the detailed anatomy of the fronto- 
basal gyri and sulci. 


We report a case of isolated agenesis of the olfac- 
tory bulbs and tracts unrelated to a pathological, syn- 
dromic, or any other kind of pattern described to date. 


CASE REPORT 


A 4-year-old girl was referred to us because the 
girl herself complained of having absolutely no sense 
of smell. Correct nasal respiration was found, and 
the child’s general state of health was good. The med- 
ical history of the family and of the pregnancy and 
the early pathological history of the child revealed 
nothing worthy of note, and the child’s general mental 
and physical development was normal. Rhinoscopy 
with rigid and flexible optic fibers did not reveal any 
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Brain magnetic resonance imaging. A) Sagittal 3-mm-thick spin echo Tl-weighted image. B) Coronal 3-mm-thick spin 
echo T1-weighted image. C,D) High-resolution coronal 0.6-mm-thick T2-weighted image obtained with 3-dimensional 
turbo spin echo sequence. No olfactory bulbs and tracts are visible. On right side, thin isointense band (arrow in C) may 
represent extremely hypoplastic olfactory remnant. More posteriorly in ipsilateral basal frontal lobe, hypoplastic olfac- 
tory sulcus is identified (arrow in D). Surface of contralateral basal frontal lobe is flat, because of complete lack of 
olfactory sulcus. Note superior anatomic details of thin coronal 3-dimensional turbo spin echo T2-weighted images as 
compared with conventional coronal spin echo T1-weighted image. Pituitary gland and hypothalamus are completely 


normal (A). 


structural alteration of the nasal fossae, the nose, or 


the pharynx, and the roof of the nasal cavity appeared 
normal on examination. Anterior rhinomanometry 
showed no nasal obstruction, and the unilateral nasal 
resistance was normal for the age of the patient (0.72 
Pa/cm? per second on the right and 0.65 Pa/cm? per 
second on the left). A smell test confirmed the pres- 
ence of anosmia to the principal smell stimuli, and 
smell stimulation with vinegar and ammonia, which 
usually produces a nasal trigeminal response, proved 
negative. Therefore, brain MRI was performed, spe- 


cifically devoted to a detailed evaluation of the ante- 
rior cranial base and the olfactory region (see Figure). 
By means of a 1.5-T machine, multiplanar 2-dimen- 
sional spin echo and turbo spin echo 3-mm-thick T1- 
and T2-weighted sequences were obtained, followed 
by 3-dimensional (3-D) high-resolution coronal turbo 
spin echo T2-weighted images (512 x 512 matrix, 
0.6-mm-thick partitions). The MRI revealed a virtu- 
ally complete absence of olfactory bulbs and tracts 
bilaterally. On the right side, a slight linear image 
above the extreme anterior portion of the fovea eth- 
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moidalis could be seen as representing an extremely 
hypoplastic olfactory residual. Also on the right, a 
rudimentary olfactory sulcus was recognized, which 
was totally absent on the left. Therefore, the orbital 
gyri and the gyrus rectus could not be reliably sepa- 
rated from each other. The findings on brain examina- 
tion were completely normal, ruling out ventricular 
and subarachnoid space abnormalities; above all, 
there were no diencephalic or pituitary alterations. 
On the basis of the above-described MRI findings, 
the patient underwent a serologic gonadotropin level 
evaluation to exclude subclinical hypogonadism. Her 
serum gonadotropin levels were within normal limits. 


DISCUSSION 


Childhood olfactory dysfunctions can often de- 
ceive even an otolaryngology specialist and conse- 
quently be underestimated as a trivial disease or as a 
way for the child to draw attention to himself or her- 
self. For this reason, we maintain that it is of the 
utmost importance never to underestimate symptoms 
reported by a child. An otolaryngological evaluation 
to exclude the more trivial disorders with routine oto- 
laryngological examinations must be carried out; oth- 
erwise, as in our case, the decision must be made to 
perform a closer examination with more sophisticated 
techniques, which enabled us to make the diagnosis 
in the present case. In fact, all too often a generalist 
or otolaryngology specialist prescribes empirical ther- 
apies to recover the sense of smell that turn out to be 
unnecessary or even damaging to the child. Magnet- 


ic resonance imaging provides the standard structural 
evaluation of the brain and, above all, provides the 
best anatomic depiction of all of the structures poten- 
tially involved in the pathogenesis of hyposmia or 
anosmia.!5-!7.!9 Hence, if the anosmia persists despite 
normal findings on intranasal examination, it is nec- 
essary to continue the diagnostic workup with MRI. 
However, it is important to stress that in such an in- 
stance, a tailored, detailed technique must be used. 
In particular, it is essential to integrate the basic brain 
study with 2- to 3-mm-thick sagittal and coronal T1- 
weighted images and with 3-D high-resolution coro- 
nally acquired turbo spin echo T2-weighted images. 
The combination of thin-slice T1-weighted images 
with submillimeter 3-D T2-weighted images makes 
amillimeter evaluation of the olfactory structures pos- 
sible, giving a precise anatomic balance and avoid- 
ing errors caused by inadequate thickness of the slices 


(partial volume effect). 


Our case is the first reported example in the medi- 
cal literature of an isolated absence of olfactory bulbs 
and tracts in an otherwise perfectly healthy child with 
no related cerebral abnormalities or other congenital 
malformations. Thus, normal serum gonadotropin 
levels are not sufficient to rule out olfactory system 
anomalies, as this approach is suitable only in terms 
of Kallmann’s syndrome. Our experience demon- 
strates the need for a thorough MRI evaluation of 
the olfactory structures in all anosmic children with 
normal intranasal examination findings. 
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The Zeitels Universal Modular (UM) Glottiscope System 





The Evolution of a Century of Design 


The recently patented UM glottiscope system was conceived from the study of a century of direct laryngoscope 
designs to incorporate the most valuable innovations into a uniquely amalgamated instrument. This glottiscope 
system provides the clinician with a versatile laryngoscope that optimally exposes vocal folds for diagnosis and 
instrumental manipulation, regardless of the diversity of human anatomical factors: age, gender, and pathology. 
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Design Features (US patent 5,893,830; international patents pending) 


e The distal lumen of the UM glottiscope is a triangular lancet-arch configuration that distracts the false vocal 
cords and conforms to the anterior glottal commissure. 

e Unlike virtually all microscope-compatible tubular laryngoscopes, which widen the proximal aperture to 
facilitate angulation of hand instruments, the UM glottiscope has bilateral proximal slots that dramatically 
improve the positioning of instrumentation. 

e The UM glottiscope is supplied with a variety of examining tubes which accommodate to the spectrum of 
human size and shape, irrespective of gender, age, or disease, and that attach to a single universal handle. 

e The universal, ergodynamically designed titanium handle can be joined with a suspension gallows as well as 
American and European chest-support holders. 

e Thedetachable base-plate is ideally suited for difficult intubations; lighting and suction cannulae are contained 
within the base-plate. 


ENDOCRAFT® LLC 





— Evolutionary Designs for Laryngeal Instrumentation — 
WEB SITE—http://www.endocraft.com 
Phone: (401) 454-7267(SCOP) Fax: (401) 331-2742 E-mail: endocraft1@aol.com 
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HEPATOCYTE GROWTH FACTOR AND ITS RECEPTOR C-MET IN RAT 


AND RABBIT VOCAL FOLDS 
SHIGERU HIRANO, MD 
i MADISON, WISCONSIN 
SUSAN THIBEAULT, PHD DIANE M. BLESS, PHD 
MADISON, WISCONSIN MADISON, WISCONSIN 
CHARLES N. FORD, MD SHIN-ICHI KANEMARU, MD 
MADISON, WISCONSIN KYOTO, JAPAN 


Vocal fold fibrotic scar is characterized by fibrosis of the lamina propria and epithelium, and is difficult to treat. Hepatocyte 
growth factor (HGF) has antifibrotic activity and has received attention as a possible therapeutic alternative to treat fibrosis. In this 
study, in order to clarify whether HGF can be involved in vocal fold scarring, we examined the existence of HGF and its receptor, 
c-Met, in rat vocal folds, and then the activity of HGF in rabbit injured vocal folds, using immunohistochemistry and enzyme-linked 
immunosorbent assay. We found HGF and c-Met on epithelial cells and gland cells of the rat vocal folds. On the injured vocal folds 
of rabbits, little HGF was observed immediately after injury, but prominent activity occurred simultaneously with reepithelialization 
of the vocal fold mucosa on days 10 to 15. The activity of HGF was observed on fibroblasts in the lamina propria, as well as the 
epithelium. It is suggested that HGF in the vocal folds is produced by the fibroblasts and delivered to the epithelium. The implication 
of these findings is that HGF is involved in wound healing of the vocal fold, and may provide an alternative approach in preventing 


and treating vocal fold scarring. 


‘KEY WORDS — c-Met, hepatocyte growth factor, immunohistochemistry, vocal fold, wound healing. 


INTRODUCTION 


Vocal fold fibrotic scar occurs from fibrosis fol- 
lowing chronic inflammation or injury. The fibrotic, 
or scarred, tissue disrupts the normal relationship be- 
tween the histologic layers of the vocal fold, chang- 
ing the biomechanical properties and resulting in dys- 
phonia. Although several surgical interventions have 
been employed, none has evolved as an optimum 
means for scar resolution. Failure to find a success- 
ful technique may be due to the approaches em- 
ployed. Historically, surgical techniques involved 
surgical excision of the fibrotic tissue. More recently, 
implants or injections of materials thought to reduce 
the effects of scarring have been used with various 
degrees of success.! Behavioral management tech- 
niques have also failed to yield normal voices in many 
cases of vocal fold scarring. These failures suggest 
that alternative approaches are needed. 


One possible alternative to traditional surgery and 
behavioral management of scarring is to prevent fibro- 
sis and scar formation. Studies of hepatocyte growth 





factor (HGF) in other parts of the body have sug- 
gested it has regenerative characteristics that work 
on wound healing as an antifibrotic factor prevent- 
ing fibrosis and scar formation.” If these same char- 


acteristics were seen in the vocal folds, the search 


for new surgical or behavioral management tech- 
niques might not be necessary. 


Unfortunately, wound healing is not well under- 
stood in the vocal fold. During wound healing of the 
vocal fold, fibroblasts play an important role. It has 
been suggested that fibroblasts produce various ex- 
tracellular matrix (ECM) components, including col- 
lagen, elastin, glycosaminoglycan, and fibronectin.3 
However, it is unknown how the fibroblasts produce 
the ECM after injury, how scars are created, or why 
some wounds heal without scars. In all likelihood, 
there are many factors affecting the manner of wound 
healing. For example, HGF prevents fibrosis as men- 
tioned, and is considered critical in prevention of scar 
formation. However, the contribution of HGF in the 
vocal fold has not been identified. 
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Hepatocyte growth factor is a potent mitogen for 
mature hepatocytes‘ that is a multifunctional poly- 
peptide that has been involved in embryogenesis, an- 
giogenesis, and tissue regeneration? and is identical 
to scatter factor and fibroblast-derived epithelial 
growth factor.>-° It is mainly synthesized in mesen- 
chymal cells, including fibroblasts,® Ito cells. Kupffer 
cells, and endothelial cells,? with the target cells be- 
ing epithelial and endothelial cells. The membrane- 
spanning tyrosine kinase c-Met is the receptor of 
HGF.’ The fact that c-Met has been reported to be 
localized in epithelial cells and endothelial cells sug- 
gests that the coupling between HGF and c-Met in- 
tegrates various biological processes in epithelial- 
mesenchymal interaction.” 


The purpose of the present study was to determine 
in vocal folds the localization of HGF and c-Met and 
the activities of HGF during acute stages of wound 
healing. 


MATERIALS AND METHODS 


All methods performed in this study were per- 
formed in accordance with the Public Health Ser- 
vice Policy on Humane Care and Use of Laboratory 


Animals, the NIH Guide for the Care and Use of 


Laboratory Animals, and the Animal Welfare Act. 
The Institutional Animal Care and Use Committee 
of the University of Wisconsin approved the animal 
use protocol. 


Rat HGF and c-Met Localization Study. Seven 
Sprague-Dawley rats weighing 200 to 300 g were 
used to examine the localization of HGF and c-Met 
in the vocal folds by means of immunohistochem- 
istry and enzyme-linked immunosorbent assay 
(ELISA). The rats were anesthetized with an intra- 
peritoneal injection of ketamine hydrochloride (90 
mg/kg) and xylazine hydrochloride (9 mg/kg), and 
then euthanized with an intracardiac injection of 7.2% 
chloral hydrate. The larynges were immediately re- 
moved and frozen in tissue freezing medium (Trian- 
gle Biomedical Sciences Inc, Durham, North Caro- 
lina) at —70°C. 


Immunohistochemistry. Five rats’ larynges were 
used for immunohistochemistry. Four- to 8-um se- 
rial sections were made in the axial or coronal plane 
with a cryostat. The tissue sections were mounted 
on microscope slides, incubated for 30 minutes at 
room temperature, and then fixed with cold acetone 
for 10 minutes. Immunohistochemistry for HGF and 
c-Met was performed by standard methods involv- 
ing biotinylated secondary antibodies and streptavi- 
din reagent (LSAB Kit, DAKO, Carpinteria, Cali- 
fornia). Affinity-purified goat anti-rat HGFo. anti- 
body (Santa Cruz Biotechnology Inc, Santa Cruz, 


California) and affinity-purified goat anti-rat c-Met 
antibody (Research Diagnostics Inc, Flanders, New 
Jersey) were prepared. Endogenous peroxidase ac- 
tivity was blocked by incubation for 5 minutes with 
3% hydrogen peroxide. The tissue sections were in- 
cubated for 10 minutes at room temperature with pro- 
tein block reagent (DAKO), and then with anti-HGFo 
antibody (1:300) or anti-c-Met antibody (1:20) over- 
night at room temperature. Bound antibodies were 
detected with biotinylated donkey anti-goat IgG sec- 
ondary antibodies (The Binding Site Ltd, Birming- 
ham, England) and streptavidin reagent, with amino- 
ethylcarbazole chromogen used as the substrate. The 
sections were counterstained with Mayer’s hematox- 
ylin. As a negative control, the primary antibody was 
replaced by a nonimmune serum. 


Enzyme-Linked Immunosorbent Assay. Two rat vo- 
cal folds were homogenized in an ultrasonic proces- 
sor in 10 mmol/L Tris buffer (pH 8.9), and then cen- 
trifuged at 15,000 rpm for 15 minutes. The superna- 
tants were used as samples. 


The ELISA for rat HGF is a sandwich method con- 
sisting of 3 steps. Microtiter plates with 96 wells were 
coated with affinity-purified rabbit anti HGFo anti- 
body (Santa Cruz Biotechnology Inc) at a concentra- 
tion of 10 g/mL in 10 mmol/L Tris buffer (pH 8.9), 
and incubated overnight at 4°C. The plates were then 
filled with blocking buffer consisting of 2% bovine 
serum albumin (BSA) in sodium phosphate buffer 
(PBS; pH 7.1), and incubated for 1 hour. The plates 
were washed 3 times with PBS containing 0.05% 
Tween 20, and then twice with PBS. Twenty-five— 
microliter aliquots of standard HGF (Sigma, St Louis, 
Missouri) or samples were dispensed into the wells, 
and the plates were sealed and incubated for 2 hours 
at room temperature. Each sample of 2 rat vocal folds 
was examined in triplicate. After washing the plates 
with same buffer mentioned above, 25-uL aliquots 
of goat anti HGFo polyclonal antibody (Santa Cruz 
Biotechnology Inc) diluted to 1:100 with 0.2% BSA 
in PBS were added to the wells. The plates were in- 
cubated for | hour and washed as described above. 
After the addition of 25-uL aliquots of donkey (anti 
goat IgG) IgG-peroxidase diluted to 1:5,000 with 
0.2% BSA in PBS, the plates were incubated for | 
hour and then washed as described above. An aliquot 
(40 uL) of 5.5 mg o-phenylenediamine in 10 mL of 
25 mmol/L citrate and 50 mmol/L phosphate (pH 6.0) 
was added, and the plates were allowed to stand for 
10 minutes. The reaction was stopped with 40 uL of 
1 mol/L sulfuric acid. Absorbance was read at 490 
nm by a microplate reader (ELX800, Bio-Tek Instru- 
ment Inc, Winooski, Vermont). 


Rabbit Immunohistochemistry and Image Analy- 


Hirano et al, Hepatocyte Growth Factor in Vocal Folds 663 


sis Study. Eleven New Zealand breeder rabbits were 
used to examine the activities of HGF during the 
course of wound healing. The animals were sedated 
with intramuscular injections of xylazine hydrochlo- 
ride (35 mg/kg), ketamine hydrochloride (5 mg/kg), 
and acepromazine maleate (0.75 mg/kg). Microscop- 
ic direct laryngoscopy was performed with a Pillings 
pediatric endoscope and an operating microscope 
(Zeiss, Thornwood, New Jersey). Once visualized, 
the epithelium with a part of lamina propria was re- 
moved from the mid-membranous portion of the right 
vocal fold with 3-mm microforceps (Microfrance, 
Montreal, Canada). The left vocal fold was main- 
tained intact as a control. The rabbits were sacrificed 
at various days after the surgery: 1 rabbit at 1 day 
after the surgery, 2 rabbits at 3 days, 3 at 5 days, 2 at 
10 days, and 3 at 15 days. The animals were sedated 
as mentioned above and euthanized with an intrave- 
nous injection of Beuthanasol (0.22 mL/kg). Then, 
the larynges were removed for immunohistochemis- 
try and image analyses. 


Immunohistochemistry and Image Analyses in 
Rabbit Study. Five-micron-thick sections of the vo- 
cal folds were prepared in the coronal plane with a 
microtome. Immunohistochemical staining for HGF 
was performed by the same procedure used in the rat 
study, with affinity-purified goat anti HGF antibody 
(Santa Cruz Biotechnology Inc). 


Slides of tissues were analyzed with an image anal- 
ysis system specifically configured for histologic 
studies. Images were captured with a Nikon Eclipse 
E600 microscope (Nikon, Melville, New York) and 
a Pixera color camera (Pixera, Los Gatos, Califor- 
nia). The intensity (amount of stain per unit area) of 
the staining in the lamina propria with or without epi- 
thelium was measured by Metamorph Image Analy- 
sis Software (Universal Imaging, West Chester, Penn- 
sylvania). The analyses were performed on both the 
injured and normal vocal folds at x20 microscopic 
power. To provide an internal control for the back- 
ground staining intensity, we divided the average 
measurements for the staining intensity for the oper- 
ated side (O) by the same measurements for the un- 
operated side (U). The final intensity index was the 
ratio O/U. 


The regeneration index was determined by scor- 
ing the degree of reepithelialization: 0 for no reepithe- 
lialization, 1 for partial reepithelialization, and 2 for 
complete reepithelialization. The scores were aver- 
aged on each day after the injury. 


RESULTS 


Rat HGF and c-Met Localization Study. Figure | 
shows coronal plane sections of rat vocal fold epi- 





Fig 1. Immunohistologic detection of hepatocyte growth 
factor (HGF) and its receptor, c-Met, in rat vocal fold in 
coronal plane. Original magnifications given on Figures. 
Arrows — stained epithelial cells. A) Negative control. 
B) Highly magnified view of epithelium in negative con- 
trol. C,D) Staining for HGF. Epithelial cells were stained. 
E,F) Staining for c-Met. Prominent staining was observed 
on epithelial cells. There was no apparent immunoreac- 
tivity for either HGF or c-Met in lamina propria and mus- 
cle layer. 


thelium, lamina propria, and muscle layer. Figure 
1A,B shows negative controls illustrating the histol- 
ogy of normal vocal folds. Figure 1B focuses on the 
epithelium at a higher magnification (original x40). 
Figure 1C,D shows sections immunohistochemically 
stained for HGF. Cells that reacted to the antibody 
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Fig 2. Immunohistologic detection of HGF on gland cells 
(arrows) in rat vocal fold presented in axial plane (origi- 
nal x10). 


were stained red. Compared with the negative con- 
trol, HGF was prominent in epithelial cells in this 
section (Fig 1D, original x40). In contrast, there was 
no obvious staining in either the lamina propria or 
the muscle layer. Figure 1E,F illustrates immunohis- 
tochemical staining of the rat vocal fold for the HGF 
receptor c-Met. Similar to the staining for HGF, the 
epithelium was prominently stained, while no appar- 
ent staining was observed in either the lamina pro- 
pria or the muscle. Gland cells were also stained im- 
munohistochemically for HGF and c-Met. As shown 
in Fig 2, the staining was again prominent in the gland 
cells (original x10). The samples showed a mean 
(+SD) concentration of HGF of 295 + 72 ng/mL on 
ELISA. 


To summarize the immunohistochemistry of the 
rat vocal folds, staining for c-Met and HGF was clear- 
ly observed on the epithelial cells and gland cells. 
There was no prominent staining in other regions 
such as lamina propria and muscle. 


Rabbit Immunohistochemistry and Image Analy- 
sis Study. Figure 3 shows immunohistochemical 
staining for HGF on day 3 (Fig 3A) and day 10 (Fig 
3B) after surgery. As shown in Fig 3A, the lamina 
propria showed little staining on day 3, and there 
was no indication that the epithelium was regenerat- 
ing. On day 10, the injured site was partly reepithe- 
lialized, and prominent staining appeared on the re- 
generating epithelium, the fibroblasts in the lamina 
propria, and the gland cells (Fig 3B). 

Figure 4 provides a graphic representation of the 


final HGF stain intensity index and the regeneration 
index from day | through day 15 after surgery. The 
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Fig 3. Immunohistologic detection of HGF in rabbit vo- 
cal fold presented in coronal plane (original x20). A) Day 
3 after surgery. There was no staining on lamina propria, 
and epithelium was not regenerated. B) Day 10. Epithe- 
lium was restored, and prominent staining was observed 
on epithelium (EP), lamina propria (LP), and gland cells 
(GL). Arrows — fibroblasts. 


HGF activity was barely evident on any of the sam- 
ples until day 5, reached a peak by day 10, and con- 
tinued to be prominent on day 15. With time, the vari- 
ability between samples increased, as seen on both 
day 10 and day 15. The regeneration index increased 
in accordance with the increase of HGF activity. 


DISCUSSION 


Vocal fold fibrotic scar is characterized by fibro- 
sis of the lamina propria and epithelium. When the 
vocal folds are injured, fibroblasts produce various 
ECM components to repair the wound. Fibroblasts 
also secrete growth factors that may control ECM 
production; however, little is known about how 
growth factors act on the vocal fold. Transforming 
growth factor B1 (TGF £1) is well recognized as play- 
ing a major etiologic role in chronic fibrotic disor- 
ders, including subglottic stenosis.*° It enhances 
ECM deposition, and it induces growth inhibition in 
epithelial and endothelial cells. In contrast, HGF 
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Stain Intensity 
of HGF 





‘Day1 Day3 Day5S Dayi0 Day15 


Fig 4. Staining intensity (solid line) of HGF and regen- 
eration index (R.I.; dashed line). Typical activity of HGF 
was observed on days 10 to 15. Regeneration index in- 
creased in accordance with activity of HGF. Circular dots 
— staining intensity of individual animals; diamond dots 
— averaged staining intensity at each postoperative date; 
O/U — staining intensity index as ratio of staining inten- 
sity of operated side (O) to that of unoperated side (U). 


potentially suppresses expression of TGF B1 and 
stimulates the protease activity responsible for deg- 
radation of ECM components.” 


We focused on HGF because of its antifibrotic ac- 
tivity during wound healing. An increase of HGF in 
wound tissues has been reported for the liver,!° kid- 
ney,!! lung,!? stomach,!3 respiratory epithelium,!4 
and lacrimal gland.!5 Similarly, culture studies have 
shown that HGF stimulates the growth of epithelial 
cells in explants of gallbladder,!© mammary gland,!” 
and middle ear mucosa.!8 It has also been shown to 
prevent the onset or progression of liver cirrhosis and 
lung fibrosis.!9.20 Knowledge of HGF’s role has led 
to new therapeutic strategies in the treatment of liver, 
renal, and lung fibrosis by means of HGF injection 
or HGF gene therapy.*:2!.22 It is reasonable to assume 
that similar strategies might be developed for treat- 
ment of vocal fold scarring, if the prevailing condi- 
tions are similar in the vocal folds to those seen else- 
where in the body. 


c-Met is a 145 and 50 kd heterodimer expressed 
predominantly on the surface of some epithelial cells, 
and the biological activities of HGF appear only when 
` HGF couples with c-Met on the target cell membrane. 
Before the current study, it had not been determined 
whether c-Met exists on the vocal fold epithelial cells. 
Using immunohistochemistry on rat vocal folds, we 
detected localization of c-Met on both epithelial and 
gland cells. The findings implied that HGF could tar- 
get the vocal folds. We also identified HGF on epi- 
thelial and gland celis. Since the vocal folds used 
were in their normal state, it may be that HGF activ- 
ities were at their minimum. Lu et al?’ reported that 
they found faint immunohistochemical staining for 
HGF in fibroblasts and epithelial cells of normal skin. 
Li et al*4 examined the localization of HGF and c- 


Met on lacrimal glands, and found HGF protein in 
interalveolar and interlobular connective tissue cells 
adjacent to glandular alveolar cells, and c-Met pro- 
tein in the glandular alveolar and interlobular ductal 
cells in the lacrimal gland. These findings are con- 
sistent with the present results. The positive results 
of ELISA in the rat study further support the exis- 
tence of HGF in the vocal fold. 


In our companion rabbit study, the activity of HGF 
was examined during acute wound healing of the 
vocal fold. Immunohistochemical staining for HGF 
showed almost no stained area in the injured vocal 
fold on days 1 to 5. The activity of HGF appeared on 
day 10, and was lowered slightly by day 15. With 
time, there also was increased variability between 
specimens. Lu et al?3 made an artificial anus in the 
rat, and examined HGF messenger RNA (mRNA) 
during wound healing of the artificial anus using re- 
verse transcription—polymerase chain reaction. They 
demonstrated a gradual increase of HGF mRNA from 
day 1 to day 14. In contrast, Wilson et al!5 reported 
peak expression of HGF mRNA at 3 days after 
wounding of corneal epithelium. It is suggested that 
the pattern of activation of HGF after a wound oc- 
curs may vary according to the site and degree of the 
wound. Since the wounds in this study were as close 
as possible to the same site and GRETE, our study did 
not address this notion. 


Reepithelialization appeared simultaneously with 
the increase in HGF activity. Further examination, 
such as immunostaining for proliferating cell nuclear 
antigen, may be necessary to reveal more detail of 
the regenerative findings. However, the current study 
suggests, at least, that there might be a relationship 
between HGF activity and reepithelialization of 
wounded vocal folds. l 


In general, the mechanism of HGF delivery has 
been considered to be the paracrine system.? In the 
vocal folds, it is assumed that fibroblasts in the lamina 
propria produce HGF, and HGF is delivered to epithe- 
lial cells that have c-Met. Once HGF is coupled with 
c-Met, it may modulate the stromal epithelial inter- 
actions associated with the scarring response. Thus, 
the staining on fibroblasts in Fig 3 is interpreted to 
represent the production of HGF, while the staining 
on the epithelium might imply that HGF was simply 
localized in the epithelium after being bound to c- 
Met. It is still unknown whether HGF could be pro- 
duced on the epithelial cells in an autocrine manner, 
and whether it could directly affect fibroblasts and 
vary the production of ECM components from fibro- 
blasts. Some investigators have suggested the auto- 
crine system as well as the paracrine one in the bio- 
logical behavior of HGF. Very recently, it was re- 
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ported that HGF induces mesothelial cell migration 
and proliferation by autocrine and paracrine path- 
ways.? Further study will be necessary to clarify these 
issues by in situ hybridization. The present study did 
not address these aspects or clarify the mechanism 
of HGF delivery. However, we wanted to study the 
possibility of using HGF in the treatment of scarred 
vocal folds, and our goal was to confirm the exis- 
tence of c-Met in the vocal folds and the actual ac- 
tivity of HGF on injured vocal folds. 


In conclusion, we found the existence and local- 





ization of c-Met in the rat vocal fold, and confirmed 
the activity of HGF after wounding the vocal fold 
using a rabbit model. Although both rat and rabbit 
models have been used extensively in laryngeal re- 
search, it is not clear how parallel the healing prop- 
erties in these animal models are to those of humans. 
The results reported here are consistent with reports 
of HGF elsewhere in the body. They suggest that fur- 
ther examination of HGF might help clarify how it 
works on wound healing and scarring of the vocal 
folds, and in turn might lead to the development of 
unique strategies for treating scarred vocal folds. 
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TYPE I THYROPLASTY FOR ACUTE UNILATERAL VOCAL FOLD 
PARALYSIS FOLLOWING INTRATHORACIC SURGERY 
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Patients who undergo intrathoracic operative procedures for malignancy may require sacrifice of a recurrent laryngeal nerve. 
Postoperative vocal fold paralysis may lead to diminished cough with secretion retention, aspiration, and life-endangering pneumo- 
nia. This study retrospectively reviews our institution’s experience of 23 patients who underwent type I thyroplasty within the 2- 
week (acute) period after thoracic surgery. Primary lung cancer (n = 16) was the most common disease. Upper lobectomy (n = 9) and 
pneumonectomy (n = 7) were the most frequent surgical procedures. Silicone medialization alọne (n = 11) or with arytenoid adduc- 
tion (n = 12) was performed. There were no significant postoperative complications. Improvements in hoarseness (86%), dyspnea 
(72%), dysphagia (50%), and aspiration (79%) were noted. Pulmonary status improved after vocal fold medialization, as reflected by 
decreased need for therapeutic bronchoscopy in the majority of patients in the postoperative period. Type I thyroplasty for vocal fold 
paralysis in the acute phase following thoracic surgery is well tolerated and is associated with improved patient outcome with no 
postoperative deaths in this high-risk patient population. 


KEY WORDS — acute early type I thyroplasty, arytenoid adduction, intrathoracic surgery, laryngeal framework surgery, unilat- 


eral vocal fold paralysis, vocal fold medialization. 


INTRODUCTION 


The incidence of unilateral vocal fold paralysis 
(UVFP) is not high in the general population. How- 
ever, of the causes seen, more than a third have an 
intrathoracic etiology.! During the process of em- 
bryogenesis, the motor innervation of the larynx from 
the vagal trunk accompanies the descent of the fourth 
branchial arch great vessels from the neck into the 
chest. As a result, the right recurrent laryngeal nerve 
arches around the right subclavian artery, while the 
left recurrent nerve travels further caudally around 
the arch of the aorta. Because of their location, these 
structures are susceptible to direct involvement by 
mediastinal and apical intrathoracic malignancies, 
and are also prone to injury during upper thoracic 
surgical procedures. 


Unilateral transection of the recurrent laryngeal 
nerve supply to the larynx results in paralysis of a 
vocal fold, usually in the paramedian or abducted 
position (Fig 1A). In a young, otherwise healthy pa- 
tient, the contralateral vocal fold may be able to com- 
pensate by crossing the midline to meet the para- 
lyzed fold. If adequate compensation is not possible, 
laryngeal dysfunction results, manifested by altered 
phonation and deglutition, aspiration, and diminished 
force of cough. 


Patients who undergo intrathoracic surgical opera- 
tions for the extirpation of lung cancer commonly 
suffer from coexisting pulmonary dysfunction, and 
are less able to tolerate UVFP. These patients are usu- 
ally of advanced age, and tobacco use may have pre- 
disposed them to cancer and other co-morbid condi- 
tions such as chronic obstructive pulmonary disease 
and cardiac disease. Further, many have compro- 
mised immune, nutritional, and metabolic status, as 
a result both of their disease and of aggressive medi- 
cal therapy (chemotherapy, irradiation). It has been 
reported that older patients and those with cancer- 
associated weight loss are significantly less able to 
compensate for UVFP.? In these debilitated patients, 
who are then subjected to major intrathoracic sur- 
gery, glottic insufficiency caused by UVFP can be 
extremely detrimental to postoperative recovery. 


Unilateral vocal fold paralysis has multiple signifi- 
cant effects on the affected patient. First, after reso- 
lution of laryngeal edema from intubation, patients 
with UVFP who have inadequate compensation from 
the normally functioning vocal fold have a weak, 
breathy voice and are notably short of breath during 
speech. In patients with severe symptoms, the abil- 
ity to sustain a continuous rate of phonatory airflow 
is decreased by one third.? Second, impaired venti- 
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Fig 1. Axial schematics of larynx. A) Left vocal fold paralysis with abducted left vocal fold position. B) Medialization of 
paralyzed vocal fold by type I thyroplasty with silicone elastomer implant. C) Closure of posterior glottic gap with arytenoid 
adduction stitch. 


lation and dyspnea result from an inability to modu- 
late the expiratory phase of respiration, leading to a 
decrease in the normal positive end expiratory pres- 
sure. Third, without adequate glottic closure, pa- 
tients are not able to generate sufficient subglottic 
pressure to produce an effective cough to clear pul- 
monary secretions. Fourth, without the protective 
mechanism of glottic closure necessary for normal 
swallowing, patients often experience dysphagia and 
are prone to bouts of aspiration and possibly life- 
threatening pneumonia. Overall, these patients may 
have difficulty with deep respiration and pulmonary 
clearance as a result of the significant discomfort as- 
sociated with surgery on the thoracic cage and reduc- 
tion of pulmonary reserve, both because of transient 
pulmonary dysfunction secondary to atelectasis and 
because of permanent loss of lung parenchyma due 
to surgical removal. The effects of an ineffective 
cough and altered respiratory dynamics are additive 
and can be crippling. 


Type I thyroplasty with and without arytenoid ad- 
duction, initially described by Isshiki et al.4+> modi- 
fied by Wanamaker et al,° Netterville et al,’ and as 
adopted by the current authors,*-? is an effective 
means of medializing the paralyzed vocal fold, and 
has provided excellent symptom relief without sig- 
nificant morbidity.!° We reviewed our experience 
with 23 patients who underwent type I thyroplasty 
within 2 weeks after intrathoracic surgery to deter- 
mine whether vocal fold medialization is beneficial 
in the acute setting, given the debilitated nature of 
these patients. 


SUBJECTS AND METHODS 


All patients who underwent unilateral vocal fold 


medialization surgery between July 1, 1991, and Au- 
gust 30, 1999, at Memorial Sloan-Kettering Cancer 
Center were retrospectively entered into a database. 
Patients who underwent type I thyroplasty with or 
without arytenoid adduction in the “acute” setting 
(within 14 days) after intrathoracic surgery with sac- 
rifice of the recurrent laryngeal nerve were chosen 
for this study. In this cohort of patients, type I thy- 
roplasty was used as an immediate permanent method 
of vocal fold medialization, because the nerve sup- 
ply to the vocal fold was known to have been in- 
volved by the intrathoracic malignant process and/ 
or required sacrifice in order to provide an adequate 
margin during intrathoracic oncological resection. 
Concurrent arytenoid adduction was performed at the 
time of medialization thyroplasty, especially later in 
the authors’ series, if preoperative examination of the 
larynx revealed a substantial glottic gap or a notice- 
able decrease in the height of the paralyzed vocal fold. 


Patient characteristics, cause of vocal fold paraly- 
sis, type of intrathoracic surgery, presenting symp- 
toms, timing and type of vocal fold medialization 
surgery, postoperative results, and complications 
were noted. Subjective and objective parameters were 
studied. The subjective measures were based on pa- 
tient and physician assessment of speech, swallow- 
ing, and respiratory status. The objective measures 
included factual history and findings on physical ex- 
amination (including flexible fiberoptic laryngos- 
copy), evidence of pneumonia on chest radiographs, 
and documented therapeutic bronchoscopy. 


The techniques used by the authors for type I thy- 
roplasty and arytenoid adduction have been reported 
previously®? and will only be summarized here. The 
procedure is relatively brief (45 to 90 minutes) and 
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Fig 2. Timing of vocal fold medialization procedure in 
days after thoracic surgery. 


is performed under local anesthesia with appropri- 
ate sedation so that the patient can phonate during 
the operation and the degree of vocal fold medializa- 
tion required can be judged. The thyroid cartilage on 
the side of the vocal fold paralysis is exposed. If ary- 
tenoid adduction is necessary, the posterior paraglot- 
tic space is dissected, with care taken not to perfo- 
rate the pyriform sinus. A permanent stitch is used to 
tag the insertion of the posterior cricothyroid muscle 
on the muscular process of the arytenoid. A fine cut- 
ting otologic bur is used to create a window in the 
thyroid cartilage, with care taken to preserve the in- 
ner perichondrium. A pocket is created to accommo- 
date the implant, and the arytenoid adduction stitch 
is then passed, under direct vision, through the thy- 
rotomy window and out through 2 bur holes drilled 
in the anterior part of the thyroid lamina. An implant 
of appropriate size is carved in lock-and-key con- 
figuration from a silicone block and snapped into po- 
sition (Fig 1B). The arytenoid adduction stitch is then 
gradually tightened (Fig 1C). The implant size and 
arytenoid adduction stitch tension are adjusted ac- 
cording to the quality of the patient’s voice. The au- 
thors do not use a flexible fiberoptic laryngoscope 
during the procedure. 


RESULTS 


There were 237 patients who underwent unilater- 
al vocal fold medialization surgery at our institution 
during the 98-month study period. Of these, 23 pa- 
tients (9.7%) had type I thyroplasty within 2 weeks 
after intrathoracic surgery for acute UVFP resulting 
from surgical sacrifice of the recurrent laryngeal 
nerve. There were 17 men and 6 women in this group, 
with a mean patient age of 64.2 years. 


Primary lung cancer was the most common dis- 
ease (70%), followed by esophageal cancer (17%) 
and other neoplasms (mediastinal disease 9%; thy- 
moma 4%). Upper lobectomy (39%) and pneumonec- 
tomy (30%) were the most frequent surgical proce- 
dures for extirpation of cancer, followed by esopha- 


gectomy (17%) and thoracotomy (13%). Left vocal 
fold paralysis (78%) was more common than right 
vocal fold paralysis (22%). Hoarseness was by far 
the most common presenting symptom (96%), fol- 
lowed by aspiration (61%), nonexertional dyspnea 
(48%), dysphagia (26%), and radiographically docu- 
mented pneumonia (13%). 


Fifty-two percent of patients had arytenoid adduc- 
tion at the time of type I thyroplasty. Arytenoid ad- 
duction was more common later in our surgical ex- 
perience. Acute vocal fold medialization for UVFP 
was performed a median of 5.6 days after intratho- 
racic surgery (range, 3 to 12 days; Fig 2). 


The patients reported improvement in hoarseness 
(86%), aspiration (79%), dyspnea (72%), and dyspha- 
gia (50%) immediately after medialization surgery, 
and the improvement has remained stable to date. 
Overall, 96% of patients had improvement of at least 
1 symptom. Two patients (9%) required postopera- 
tive therapeutic bronchoscopy during their hospital- 
ization; both of these patients had undergone esopha- 
gectomy. In both cases, flexible fiberoptic laryngos- 
copy revealed the paralyzed vocal fold to be well 
medialized after type I thyroplasty. One of these pa- 
tients had improvement in swallowing with dimin- 
ished aspiration after esophageal dilation, but the 
other patient eventually required percutaneous en- 
doscopic jejunostomy placement and tube feeding. 
None of the 23 patients had radiographic evidence 
of pneumonia after acute vocal fold medialization. 


Two patients (9%) who had arytenoid adduction 
with type I thyroplasty experienced transient vocal 
fold edema that necessitated close airway observa- 
tion but improved with steroids and several additional 
days of hospital observation. There were no signifi- 
cant complications after acute vocal fold medializa- 
tion surgery. 


DISCUSSION 


The patient demographics of advanced age, male 
predominance, and impaired pulmonary function of 
the patients in our study are typical of patients who 
undergo intrathoracic oncological surgery. As many 
as 73% of patients with recurrent laryngeal nerve in- 
jury during intrathoracic surgery suffer from aspira- 
tion, predisposing them to life-threatening pneumo- 
nia.!! In this regard, the remarkable improvement in 
subjective patient symptoms following type I thyro- 
plasty in the acute setting seen in our study is very 
gratifying, especially given the impaired nature of 
this patient population in need of optimization of pul- 
monary function to facilitate recovery from thoracic 
surgery. By reestablishing glottic competence, these 
patients were able to generate the subglottic pres- 
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sure necessary for a strong cough to clear pulmonary 
secretions, and were able to protect their airway from 
aspiration during swallowing. The only 2 patients who 
required postoperative bronchoscopy had undergone 
esophageal resection, and esophageal anastomotic 
stricture most likely contributed to persistent aspira- 
tion, despite adequate medialization of the paralyzed 
cord. Further, most of the patients’ symptoms im- 
proved, and none of the patients developed pneumo- 
nia during their hospitalization following acute type 
I thyroplasty. 

Although the intrathoracic operations in these pa- 
tients in our series with UVFP were undertaken with 
curative intent, review of the literature indicates that 
many of these patients will eventually succumb to 
their underlying disease. Gardner et al!? reported 20% 
one-year survival after vocal fold medialization in 
patients with lung cancer. Hirano et al!? documented 
63% one-year, 32% two-year, and 22% five-year sur- 
vival after esophageal cancer operations with vocal 
fold paralysis. In a study of patients at our tertiary 
care cancer center, we determined 53% one-year sur- 
vival following type I thyroplasty and arytenoid ad- 
duction for UVFP regardless of cause.!° However, 
because all of the patients in our study underwent 
aggressive surgical management with curative intent, 
it was not possible to determine in advance which 
patients would not survive their disease. Even in pa- 
tients who have only a limited life span, it is our opin- 
ion that the improved quality of life gained by hav- 
ing better speech, ability to clear secretions, and abil- 
ity to breathe and swallow without aspirating is of 
significant benefit. 

All of the patients in this study had documented 
sacrifice of the recurrent laryngeal nerve and were 
therefore ideal candidates for immediate vocal fold 
medialization by type I thyroplasty and arytenoid ad- 
duction. Injection of Teflon into the paralyzed vocal 
fold in order to achieve immediate, permanent medi- 
alization is no longer advocated because of the high 
incidence of delayed Teflon granuloma formation. 
Neuromuscular transfer techniques may ultimately 
allow glottic competence, but do not provide the im- 
mediate vocal fold medialization necessary to pro- 
mote immediate recovery after intrathoracic surgery 
in this group of patients. For patients in whom the 
nerve supply to the vocal folds is uncertain or is 
thought to have been left intact after intrathoracic 
surgery, temporary medialization can be achieved by 
direct injection of the paralyzed vocal fold with Gel- 
foam, collagen, acellular dermis (Cimetra), autolo- 
gous fat, or other resorbable substances. Laryngeal 
electromyography may be useful in determining the 
status of motor innervation in indeterminate cases 
of UVFP. 


The timing of vocal fold medialization surgery 
following the intrathoracic operation is critical. Ideal- 
ly, medialization surgery should be undertaken as soon 
as possible after injury to the motor nerve supply of 
the vocal fold, in order to maximize the benefits of 
the procedure. Many of our patients undergo vocal 
fold surgery with chest tubes still in place. However, 
since these patients are intubated with large-diameter, 
double-lumen endotracheal tubes at the time of their 
thoracic surgery, postoperative laryngeal edema can 
camouflage the appearance and symptoms of UVFP 
for 48 to 72 hours after extubation. Further, because 
laryngeal edema precludes adequate assessment of 
the degree of contralateral vocal fold compensation, 
and would also interfere with determining the de- 
gree of medialization necessary during the thyro- 
plasty procedure, it is our practice to evaluate these 
patients on a daily basis after extubation. Typically 
at 2 to 4 days after intrathoracic surgery, patients be- 
gin to manifest symptoms as laryngeal edema re- 
solves, and an accurate assessment of glottic incom- 
petence can be made to determine whether vocal fold 
medialization surgery is necessary. 

Immediate medialization of the paralyzed vocal 
fold has been performed at the time of intrathoracic 
surgery by Mom et al,!4 and after skull base and head 
and neck surgery by Netterville et al,’ requiring sac- 
rifice of the vagus or recurrent laryngeal nerve. Both 
groups advocate use of preoperative computed to- 
mography scans of the larynx to estimate the size of 
the implant that would be necessary for appropriate 
vocal fold medialization. However, | of 14 patients 
in the series of Mom et al required revision secondary 
to overmedialization. In the series of Netterville et 
al, of the 32 patients who underwent immediate vo- 
cal fold medialization, | patient required tracheotomy 
for airway compromise and 5 others required revi- 
sion. By contrast, none of the patients in our series 
have suffered significant respiratory distress or re- 
quired revision. Immediate medialization may also 
subject a certain percentage of patients with UVFP 
who have adequate compensation from the contra- 
lateral nonparalyzed vocal fold to an unnecessary pro- 
cedure. 

Bielamowicz et al!5 reported performing arytenoid 
adduction alone under general anesthesia, usually on 
postoperative day 2 with vagal injury and UVFP fol- 
lowing skull base surgery. After medialization, 4 of 
26 patients had a persistent anterior glottic gap, 3 of 
whom required secondary type I thyroplasty. It is our 
opinion that type I thyroplasty with arytenoid adduc- 
tion, performed in an awake patient who is able to 
vocalize in order to determine the degree of vocal 
fold medialization required, is the most effective 
method of addressing UVFP in the acute setting. 
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In conclusion, type I thyroplasty with or without 
arytenoid adduction for acute UVFP following intra- 
thoracic surgery is well tolerated in this high-risk pa- 





tient population and results in improved patient symp- 


toms, a decreased risk of pulmonary complications, 
and minimal morbidity. 
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DYSPHAGIA AND ASPIRATION WITH UNILATERAL VOCAL CORD 
IMMOBILITY: INCIDENCE, CHARACTERIZATION, AND RESPONSE TO 
SURGICAL TREATMENT 


NEIL BHATTACHARYYA, MD 
TAMAR KOTZ, MS JO SHAPIRO, MD 


BOSTON, MASSACHUSETTS 


To objectively determine the incidence of dysphagia associated with unilateral vocal cord immobility (UVCD) and to evaluate the 
potential for response to vocal cord medialization, we made videofluoroscopic swallowing (VFS) recordings of patients with newly 
diagnosed UVCI and prospectively analyzed them in a blinded fashion using the Penetration-Aspiration Scale (PAS) and pharyngeal 
transport function measures. A subset of patients underwent vocal cord medialization and were evaluated with a postoperative VFS 
study. Comparison was made between preoperative and postoperative VFS results to study the effects of vocal cord medialization in 
this setting. Eighty-seven VFS recordings were studied in 64 adult patients with UVCI; 23 patients underwent VFS testing before and 
after vocal cord medialization. The UVCI was most commonly left-sided (53 cases) and most commonly resulted from thoracic or 
cardiac surgery (53.1%), followed by malignancy (15.6%). Overall, the median PAS score was 2.0 (25th-75th percentiles, 1.0-5.0), 
with 20 patients (31.3%) and 15 patients (23.4%) exhibiting penetration or aspiration, respectively. No significant differences in 
swallowing function were noted between surgical and nonsurgical causes of paralysis (PAS scores of 2.0 and 2.0, respectively; p = 
.901). The median PAS score improved from 4.0 to 3.0 (p = .395, Wilcoxon paired samples test) in patients studied after undergoing 
a vocal cord medialization procedure (6 laryngoplasties and 17 vocal cord injections). Laryngoplasty was not more successful than 
vocal cord injection in resolving aspiration (p = .27). Radiographically significant penetration or aspiration occurs in approximately 
one third of patients with UVCI, independent of the cause of paralysis. Vocal cord medialization may not be as effective as thought 
for eliminating aspiration in these patients. 


KEY WORDS — aspiration, dysphagia, unilateral vocal cord immobility, videofluoroscopic swallowing study, vocal cord medial- 
ization. 


INTRODUCTION view board for human studies research. Adult patients 
The human larynx plays pivotal roles in airway with a new diagnosis of UVCI were prospectively 
protection, respiration, and phonation.! Paralytic uni- enrolled in this study. In each case, the diagnosis of 
lateral vocal cord immobility (UVCI) is the most UVCI was confirmed by transnasal flexible fiberoptic 
common neurogenic disorder affecting the larynx. laryngoscopy. The term UVCI will be used here for 
While the effect of UVCI on voice and phonation has purposes of simplicity, although in some cases the 
been the subject of considerable research, the swal- actual diagnosis may in fact have been unilateral vo- 
lowing impact of UVCI has received much less atten- cal cord fixation. In many instances, the cause of the 
tion. Although UVCI has been postulated to predis- immobility was clear, ie, recurrent laryngeal nerve 
pose patients to penetration, aspiration, and other paralysis from thoracic or cardiac surgery or tumor- 
swallowing abnormalities, limited data exist regard- related paralysis. For each patient, the medical record 
ing the true incidence and nature of these abnormali- and radiologic studies were reviewed. Data were col- 
ties in the setting of UVCI. Specifically, the patho- lected for patient demographics, side of the lesion, 
physiology of the swallow in the setting of UVCI proposed cause of the UVCI, and date of first diag- 
has not been defined. This study was therefore under- nosis. 
taken to determine the incidence of swallowing dys- Each patient in this group (group 1) then under- 
function in the setting of a newly diagnosed UVCI, went a videofluoroscopic swallowing (VFS) study. 
characterize the dysphagia if present, and determine All patients underwent speech and swallowing reha- 
whether vocal fold medialization procedures improve bilitation with trained personnel throughout the course 
the swallowing dysfunction associated with UVCI. of their management. We made VHS video record- 
ings of the fluoroscopic images of the oral and pha- 
METHODS ryngeal stages of the swallowing in the lateral and 
This study was approved by our institutional re- anterior-posterior projections. For the purposes of this 
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TABLE 1. PENETRATION-ASPIRATION SCALE 
Description 








Category Score 
None 1 Contrast does not enter airway 
2 Contrast enters airway; remains above 
vocal folds; no residue 
3 Contrast remains above vocal folds; 
visible residue remains 
4 Contrast contacts vocal folds; no residue 
Contrast contacts vocal folds; visible 
residue remains 
6 Contrast passes glottis; no subglottic 
residue visible 
Contrast passes glottis; visible subglottic 
residue despite patient’s response 
8 Contrast passes glottis; visible subglottic 
residue; absent patient response 
Scale from Rosenbek et al.4 





Penetration 


Ur 


Aspiration 7 





investigation, the lateral view of the swallow was 
used, as it provides an unobstructed view of the aero- 
digestive pathway and allows for easier identifica- 
tion of aspiration than do alternative views.? The vid- 
eo recordings of each patient’s 5-mL liquid barium 
swallow study were subjected to in-depth real-time, 
slow-motion, and frame-by-frame analysis by a se- 
nior speech pathologist experienced in radiographic 
evaluation of swallowing disorders. Completed re- 
cordings were analyzed in random order at a later 
date without information (investigator blinded) re- 
garding the identity, diagnosis, and treatment of the 
study subjects. ` 


The Penetration-Aspiration Scale (PAS) developed 
by Rosenbek et al4 (Table 1) was used to score each 
VFS study. This swallowing scale scores the swal- 
low by the depth to which material passes into the 
airway and by whether or not material entering the 
airway is expelled. The intrarater and interrater reli- 
ability of this scale has been previously established.’ 
In addition, the timing of the penetration or aspira- 
tion was recorded (ie, before the swallow, during the 
swallow, or after the swallow). Penetration was de- 
fined as the passage of bolus material into the laryn- 
geal inlet without passage beyond the level of the 
true vocal folds. Aspiration was defined as the pas- 
sage of bolus material beyond the level of the true 
vocal cords. To further describe the pharyngeal stage 
of the swallow, we determined the presence or ab- 
sence of swallow delay and reduced laryngeal eleva- 
tion and documented the presence or absence of bo- 
lus residue along the base of the tongue, valleculae, 
pyriform sinuses, posterior pharyngeal wall, and pha- 
ryngoesophageal segment. Evidence of cricopharyn- 
geal dysfunction was also recorded. 


A subgroup of patients (group 2) underwent a pro- 
cedure directed at improving phonation, tussive 
strength, and airway protection. This patient popu- 


lation was recruited in the second half of the study. 
The procedure undertaken, vocal fold augmentation 
(ie, Gelfoam or Teflon injection under direct laryn- 
goscopy) or type I thyroplasty, was recorded. The 
choice of procedure was left to the discretion of the 
attending surgeon and was based on clinical factors 
including the need for pulmonary toilet, the need for 
voice rehabilitation, the patient’s suitability for anes- 
thesia, and the prognosis for the patient’s recovery. 
Each of these procedures was performed with stan- 
dard technique. The patients then underwent a post- 
operative VFS study to examine the effect of the la- 
ryngeal procedure on deglutition. A strong effort was 
made to perform the postoperative VFS study within 
1 month of the surgical procedure, but variability due 
to patient condition, geography, and scheduling is- 
sues was encountered. Patients were excluded if their 
postoperative VFS study could not be performed with- 
in 4 months of the procedure. In all cases, continued 
vocal cord immobility was confirmed with repeat fi- 
beroptic laryngoscopy prior to the postoperative VFS 
study. This VFS study was analyzed by the same 
methods as the preoperative study, with the rater 
blinded to the timing of the swallow (ie, before or 
after vocal fold medialization) and the underlying 
diagnosis. 


Demographic and VFS data were tabulated and 
entered into a database. The data were analyzed with 
the SPSS statistical package system version 10.0 (Chi- 
cago, Illinois). Descriptive statistics and frequencies 
were computed for demographic variables, causes 
of UCVI, and incidence of aspiration. The Mann- 
Whitney U test was used to determine differences in 
swallowing parameters between acute and nonacute 
causes and surgical versus nonsurgical causes of pa- 
ralysis. A Kruskal-Wallis analysis of variance 
(ANOVA) was conducted to explore differences in 
the swallow parameters among causes. 


In the subgroup of patients (group 2) who were 
studied both before and after their laryngeal proce- 
dure for UVCI, scores on the PAS study were com- 
pared with a Wilcoxon paired samples test. Intragroup 
comparisons were conducted examining the effec- 
tiveness of thyroplasty versus vocal cord injection 
in resolving aspiration. A power analysis was con- 
ducted by using the efficiency of the given nonpara- 
metric test based upon Student’s paired t-test for this 
group as well.® 


RESULTS 


We prospectively enrolled 64 adult patients (30 
women and 34 men) in this study. The mean patient 
age at the time of diagnosis was 61.5 years (SD, 12.6). 
In 53 patients (82.8%), the paralysis was left-sided; 
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TABLE 2. DISTRIBUTION OF PARALYSES BY CAUSE 











Cause No. % 
Thoracic surgery 25 39.1 
Cardiac surgery 9 14.1 
Neck surgery (nonthyroid) 2 3:1 
Thyroid surgery 3 4.7 
Stroke or intracranial cause 4 6.3 
Idiopathic 6 9.4 
Thoracic or mediastinal malignancy 10 15.6 
Aortic aneurysm l 1.6 
Other or unknown 4 6.3 
Total 64 





the remaining 11 patients (17.2%) had right-sided 
UVCI. Table 2 lists the distribution of paralyses by 
cause. Thoracic surgery accounted for the greatest 
percentage of UVCI cases (39.1%). Patients in the 
“unknown” cause group did not have a discernible 
cause for the UVCI (ie, the diagnostic evaluation of 
the UVCI cause was not complete enough to assign 
them to the idiopathic group) or were more likely to 
have unilateral vocal cord fixation based on a his- 
tory of underlying rheumatoid arthritis or autoimmune 
disease. 


The VFS study was performed on average 8.9 
weeks after the initial diagnosis of the UVCI for 
group 1 (the 64 patients with a new diagnosis of 
UVCI). The median PAS score with newly diagnosed 
UVCI was 2.0 (25th-75th percentiles, 1.0-5.0). Twen- 
ty patients (31.3%) exhibited laryngeal penetration 
without aspiration, and 15 patients (23.4%) exhib- 
ited frank aspiration on VFS study. Data for the tim- 
ing of penetration and aspiration are presented in Ta- 
ble 3. Several patients exhibited penetration without 
aspiration at one point in the swallow, and subse- 
quently aspirated later in the swallow, accounting for 
the additional 11 patients with penetration in Table 
3. Data regarding the various anatomic sites of bo- 
lus residue are presented in Table 4. Typical examples 
of VFS images obtained from our patient population 
are presented in Fig 1. Ina small number of patients, 
we were unable to assess the presence or absence of 
bolus residues because of poor radiologic imaging, 
failure to capture this portion of the swallow on vid- 
eotape, or other factors. Overall, 33 patients (51.6%) 
exhibited significant pharyngeal bolus residue in 1 


TABLE 3. TIMING OF PENETRATION OR ASPIRATION | 


Aspiration 








Penetration* 








Phase No. KA No. % 

None 33 51.6 49 76.6 
Before swallow 4 6.3 0 0.0 
During swallow 26 40.6 8 12.5 
After swallow l 1.6 J: 10.9 


*Penetration without aspiration. 





TABLE 4. LOCATION AND FREQUENCY OF BOLUS 














RESIDUE 
Unable to 
Present Absent Assess 
Anatomic Site No. % No. % No. & 
Base of tongue 8 125 52 813 4 6.3 
Vallecula 30 469 30 469 4 6.3 
Pyriform sinus 20 31.3 40 625 4 6.3 
Posterior pharyngeal 
wall 5 7.8 54 844 5 7.8 


Cervical esophagus 0 00 59 922 5 7.8 





or more subsites. Thirty-three patients (51.6%) ex- 
hibited evidence of a delay in the initiation of swal- 
lowing, and 14 patients (21.9%) were found to dem- 
onstrate reduced laryngeal elevation during the swal- 
low. Reduced pharyngoesophageal sphincter open- 
ing (cricopharyngeal dysfunction) was identified in 
5 patients (7.8%). 


No statistically significant difference was found 
on the PAS score between right-sided (median PAS 
score, 3.0) and left-sided cases of UVCI (median PAS 
score, 2.0; p= .489, Mann-Whitney U test). Similar- 
ly, patient age and gender were not statistically re- 
lated to the PAS score (p > .05). For the purposes of 
comparison, the patients were further subclassified 
into 4 groups based on broader etiologic categories. 
The median PAS data distributed by etiologic cate- 
gory are displayed in Table 5. No significant differ- 
ence in PAS score was identified between the vari- 
ous UVCI causes (p = .931, Kruskal-Wallis ANOVA). 
Patients with UVCI due to surgical causes manifested 
a median PAS score of 2.0, and those with UVCI 
from a nonsurgical cause had a median PAS score of 
2.0 (p = .901). Patients with bolus residue in 1 or 
more subsites had a statistically higher median PAS 
score (3.0) than patients who did not exhibit residue 
(median PAS score, 1.0; p = .017). No statistically 
significant difference in PAS scores was identified 
among patients with and without cricopharyngeal dys- 
function (PAS scores, 1.0 and 2.5, respectively; p = 
45). 


Twenty-three patients (group 2) were studied with 
VFS before and after a vocal cord medialization pro- 
cedure. Seventeen vocal cord injections (14 Gelfoam 
vocal cord injections and 3 Teflon injections) and 6 
medialization thyroplasties were performed. The 
mean time interval between the laryngeal procedure 
and the follow-up VFS study was 2.4 weeks (SD, 
3.5), and all VFS studies were performed within 4 
months of operation. The distribution of PAS scores 
obtained before and after the procedure is displayed 
in Table 6 and Fig 2. Overall, the median improve- 
ment in PAS score from 4.0 to 3.0 was not statisti- 
cally significant (p = .395, Wilcoxon paired samples 
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Fig 1. Examples of videofluoroscopic swallowing findings in patients with unilateral vocal cord immobil 
ity. A) Normal bolus movement without penetration or aspiration. B) Laryngeal penetration without aspi 
ration. Note that bolus stops at vocal folds (arrow). C) Frank aspiration of bolus. Arrow — vocal folds. D) 
Significant pooling of bolus in valleculae (small arrow) and pyriform sinuses (large arrow). 


test). Power calculations indicated an overall power 
of 0.95 to detect a change in the PAS scale of 2.0 
units. 

No statistically significant difference between me- 
dialization thyroplasty and injection augmentation 
was found in terms of improvement of the PAS score 
(p = .27). In group 2, 2 patients with radiographi- 
cally severe aspiration (PAS score > 7) had signifi- 
cant improvement in their postoperative PAS scores 
with no aspiration; | patient without severe aspira- 





TABLE 5. PAS SCORES BY ETIOLOGIC GROUP 








Median 25th-75th 
Group N PAS Score Percentiles 
Chest cause 45 2.0 1.0-5.0 
Neck cause 5 2.0 1.0-3.5 
Idiopathic 10 2.5 1.0-7.3 
Intracranial 4 4.0 1.0-7.8 
Overall 64 2.0 1.0-5.0 


PAS — Penetration-Aspiration Scale.* 





tion worsened after the vocal cord injection proce- 
dure, and had severe aspiration on the VFS study. 


DISCUSSION 


Although the human larynx is most commonly 
thought of as a phonatory organ, teleologically, its 
most important function is the protection of the air- 
ways during deglutition.! The protective effect of the 
larynx during swallowing is derived from mechani- 
cal closure of the glottis resulting in a physical bar- 
rier between the hypopharynx and the trachea, as well 
as epiglottic deflection, laryngeal elevation, and ap- 
position of supraglottic structures, including the ary 
epiglottic folds and the false vocal folds. 


Paralytic UVCI is the most common neurologic 
disorder of the larynx.? Clinically, UVCI should be 
distinguished from unilateral true vocal fold fixation 
and laryngeal synkinesis, as each of these diagnoses 
may have different potential causes, treatment strat- 
egies, and clinical outcomes. Although we did not 
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TABLE 6. PENETRATION-ASPIRATION SCORES BEFORE AND AFTER VOCAL FOLD MEDIALIZATION 














PROCEDURES 
Before Procedure After Procedure Median 
Median 25th-75th Median 25th-75th PAS Score 
Procedure -PAS Score Percentiles PAS Score Percentiles Improvement p* 
Medialization thyroplasty 2.5 1.8-7.3 3.0 1.8-7.3 —0.50 .690 
Vocal cord injection 4.4 1.0-7.5 3.0 1.0-7.0 1.40 500 
Overall 4.0 1.0-7.0 3.0 1.0-7.0 1.00 .395 


*Wilcoxon paired samples test. 





perform laryngeal electromyography in our series of 
patients with newly diagnosed UVCI, the vast ma- 
jority of our cases were neural paralyses rather than 
vocal fold fixation. Four pat:ents in this series were 
thought to have a probable diagnosis of vocal cord 
immobility due to fixation (included in the unknown- 
cause group) rather than neurologic paralysis (Table 
2). Although patients with UVCI have an alternate 
underlying pathophysiology with respect to the cause 
of vocal fold fixation, they often exhibit a degree of 
glottic insufficiency similar io that of patients with 
vocal cord immobility. Therefore, we have included 
these cases in our analysis. 


Clinically, patients with UVCI most commonly 
present with dysphonia, which is often the predomi- 
nant clinical complaint prompting medical evalua- 
tion.8 However, given the important role of the lar- 
ynx in deglutition, UVCI may also result in dyspha- 
gia. The dysphagia may rang2 from mild symptoms 
that do not interfere with an cral diet to more severe 
problems characterized by aspiration pneumonia. 
Furthermore, patients with n2urologic disorders or 
already poor pulmonary function may be at greater 
risk for complications arising from dysphagia with 





Post-procedure PAS Score 





0 1 2 3 4 5 6 7 8 9 


Initial PAS Score 


Fig 2. Effects of laryngeal medialization procedures on 

Penetration-Aspiration Scale (PAS) scores in 23 patients. 

Data points below 45° line imply decreased aspiration 

after procedure. Total number appears less than 23 be- 
- cause of overlap of data points. 





UVCL In addition, as UVCI often diminishes cough 
strength, thereby impeding one of the central mecha- 
nisms of airway cleansing and protection, patients 
may face an even greater risk. This combination may 
be especially serious in patients with chronic obstruc- 
tive pulmonary disease, in whom even mild aspira- 
tion with weakened cough strength may have disas- 
trous health consequences. 


Other, smaller studies have identified the potential 
impact of UVCI on airway protection and swallow- 
ing. In a small series of patients with UVCI second- 
ary to surgical trauma, Perie et al? found 2 of 5 pa- 
tients to have significant aspiration on VFS. Eventu- 
ally, all of these patients returned to a normal diet, 
despite persistent UVCI. In a larger retrospective se- 
ries, Flint et al® found that approximately 60% of 
patients with UVCI complained of some swallow- 
ing difficulty, ranging from mild to severe. Qualita- 
tively, they found that medialization thyroplasty was 
somewhat effective in allowing a return to normal 
diet in the setting of UVCI. However, objective evalu- 
ation of the dysphagia and aspiration risk associated 
with UVCI was not systematically conducted with 
VES studies. In a recent study by Heitmiller et al,10 
the prevalence of aspiration and laryngeal penetra- 
tion in patients with UVCI was retrospectively as- 
sessed. Aspiration and laryngeal penetration were 
identified in 38% and 12% of patients, respectively. 
These studies provide evidence that patients with 
UVCI are at significant risk for dysphagia and aspi- 
ration. Certainly, aspiration is the most serious po- 
tential dysphagic consequence of UVCI, and must 
be assessed in each newly diagnosed case to ensure 
that the patient can swallow safely. Part of this evalu- 
ation should include assessment of the patient’s pul- 
monary function and neurologic status with respect 
to the potential clinical impact of even mild aspira- 
tion. 


Our patient population constitutes a prospective 
sample of patients with newly diagnosed UVCI. The 
patient characteristics and the distribution of paraly- 
ses by cause were similar to those of other studies of 
UVCI.!112 We found that thoracic and cardiac sur- 
gery accounted for the vast majority of the paraly- 
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ses. Our frequencies for laryngeal penetration and 
aspiration are similar to those of Heitmiller et al.!° 
In most of our cases, laryngeal penetration was found 
to occur during the swallow, indicating decreased clo- 
sure of the laryngeal vestibule at the time of the swal- 
low. About half of the patients, however, were found 
to aspirate after the swallow. This effect is due to the 
persistence of bolus residue in the pharynx once the 
swallow is completed; the residue is then aspirated 
when the patient inspires after the swallow. The pres- 
ence of pharyngeal residue can usually be attributed 
to reduced posterior movement of the tongue base, 
reduced pharyngeal wall movement, and decreased 
laryngeal elevation. However, most of our patients 
had recurrent laryngeal nerve injury without involve- 
ment of the pharyngeal plexus; these patients had no 
readily discernible reason for, or evidence of, de- 
creased tongue base or pharyngeal wall movement. 
As we found a statistically significant association be- 
tween the presence of bolus residue and a higher PAS 
score, persistence of the barium bolus in the vallecu- 
lae and pyriform sinuses in a significant proportion 
of the patients highlights bolus residue as a risk fac- 
tor for postswallow aspiration in patients with UVCI. 
In these patients, it may be that a persistently open 
airway (resulting from incomplete glottic closure) 
may impair the necessary negative hypopharyngeal 
sump pressure needed to propel the bolus through 
the pharynx, thereby resulting in postswallow resi- 
due. Postswallow aspiration does seem, at least in 
part, to be linked to persistent bolus residue above 
the larynx. In this patient population, swallowing 
strategies directed at eliminating pharyngeal residue 
may help prevent aspiration. 


We did not find reduced pharyngoesophageal 
sphincter opening (cricopharyngeal dysfunction) to 
be a common finding in the setting of UVCI. Al- 
though others have suggested that cricopharyngeal 
myotomy may be helpful in swallowing rehabilita- 
tion for patients with UVCTI, our data do not support 
this conclusion.®:!3 Other investigators have found a 
similar limited prevalence of cricopharyngeal dys- 
function in UVCI.!° 


We were unable identify any influence of the cause 
of UVCI on the PAS score. Similarly, surgical ver- 
sus nonsurgical causes of UVCI were not found to 
result in different swallowing handicaps. Other au- 
thors have identified differences in the swallowing 
impact of various UVCI causes. '‘ It is generally felt 
that skull base and “high” vagal lesions result in more 
significant swallowing dysfunction.’ As we did not 
have a significant proportion of paralyses due to skull 
base lesions or intracranial causes, we were unable 
to confirm this correlation statistically, although in- 
tracranial causes did have a trend toward a higher 


median PAS score. Intracranial or skull base abnor- 
malities leading to UVCI result in more anatomic 
and physiologic abnormalities as compared to recur- 
rent laryngeal nerve paralysis alone. Pharyngeal bo- 
lus propulsion, pharyngolaryngeal sensation, esoph- 
ageal motility, and cognitive ability may all be ad- 
versely affected, leaving these patients with substan- 
tially poorer swallowing function. Consequently, oth- 
er investigators have advocated early medialization 
procedures for these patients.8.!3-15 


Both injection techniques and medialization thyro- 
plasty have been shown to be effective in rehabilita- 
tion of the voice in UVCI. Anatomically, both tech- 
niques share the common goal of repositioning the 
free margin of the paralyzed true vocal cord as close 
to the midline as possible.!6!7 This repositioning then 
affords more complete glottic closure and a subse- 
quent improvement in voice quality. However, little 
objective data are available regarding swallowing 
outcomes in these patients. Often, improvement in 
deglutition is a secondary concern, as the dysphonia 
is perceived to be the major clinical problem. How- 
ever, it is clear from the prevalence data presented 
above that a significant percentage of patients will 
suffer from significant laryngeal penetration or aspi- 
ration and may require intervention directed at swal- 
lowing rehabilitation. We were surprised that neither 
vocal cord injection techniques nor medialization thy- 
roplasty were able to effect a significant improve- 
ment in the PAS score in UVCI. Although many pa- 
tients started with “normal” swallows in terms of their 
PAS scores, several patients had a poorer score after 
their procedure. There may be several reasons for 
poorer scores after medialization thyroplasty. For ex- 
ample, the surgical approach to the laryngeal frame- 
work involves dissection near the strap musculature. 
Inadvertent trauma to the strap muscles, the superior 
laryngeal nerve, or the ansa cervicalis may lead to 
postoperative swallowing dysfunction due to cica- 
trix formation, diminished laryngeal elevation, or de- 
creased laryngopharyngeal sensation. The reasons for 
poorer swallowing performance after injection vo- 
cal cord augmentation are less clear, since this tech- 
nique is minimally invasive and does not disturb the 
surrounding soft tissues and innervation of the lar- 
ynx. The lack of improvement in PAS score after in- 
jection techniques implies that dysphagia and aspi- 
ration in UVC] are much more complex than a simple 
lack of closure at the glottic level. Persistent dys- 
phagia may be due to altered laryngopharyngeal sen- 
sation related to the nerve paralysis, decreased laryn- 
geal elevation, or other factors yet to be elucidated. 

Swallowing failure after medialization procedures 


may also result from technical errors in the perfor- 
mance of the procedure itself. All of our patients were 





678 Bhattacharyya et al, Unilateral Vocal Cord Immobility 


deemed to have a “satisfactory” medialization result 
based upon subjective improvement in cough strength 
and phonation time. Furthermore, none of the pa- 
tients required a near-term revision of the medializa- 
tion procedure prior to the swallowing study. How- 
ever, it is still possible that a technical error, such as 
an excessively anterior injection with a persistent pos- 
terior glottal gap, could have led to persistent or wor- 
sened aspiration. These factors must be kept in mind 
when interpreting our data, and when considering 
patients with UVCI for these medialization proce- 
dures. 


Our data suggest that persistent bolus residue in 
the pharynx is implicated in aspiration in patients 
with UVCI and that this is a topic for further study. 
However, it is important to note that patients with 
UVCI may benefit significantly from vocal cord me- 
dialization because of an improved cough enabling 
the patient to clear secretions and aspirated material. 
Furthermore, it is imperative that patients receive sys- 
tematic swallowing therapy in the management of 
their vocal cord immobility. 


Other authors have found various medialization 
techniques valuable in the management of aspiration 
in this setting. For example, Woodson!§ found that 
cricopharyngeal myotomy and arytenoid adduction 
were helpful in eliminating aspiration in 10 of 13 
patients with combined laryngeal and pharyngeal pa- 
ralysis. However, the swallowing outcomes were not 
objectively documented with VFS studies. Laccour- 
reye et al!? found that autologous fat injection elimi- 
nated aspiration in 17 of 20 patients, as documented 
by fiberoptic examination of swallowing with meth- 
ylene blue-stained pureed food. It is difficult to com- 
pare these results with our series, since we did not 
perform arytenoid adduction or cricopharyngeal my- 
otomy, and we analyzed the swallowing with a liq- 


uid rather than a pureed bolus. The differences in 
swallowing outcomes between these series and our 
present series to some degree reflect the objective 
stringency of the VFS study in the evaluation of swal- 
lowing. We chose to use the VFS study because it 
provides an objective measure of swallowing that can 
be evaluated by a blinded observer. However, al- 
though the VFS study may document penetration or 
aspiration, the patients may still be able to tolerate 
oral diets with other maneuvers, such as alterations 
in food consistency. We, too, have generally found 
that patients have clinical improvement in their over- 
all swallowing function and a reduction in the vol- 
ume of aspiration after vocal cord medialization or 
injection procedures. 


Given that we were unable to identify a statisti- 
cally significant improvement in the PAS score from 
laryngeal procedures, power calculations were per- 
formed. This analysis indicated that the sample size 
had a greater than 0.95 chance of detecting a median 
change in PAS score of 2 units. We therefore con- 
clude that although one might intuitively expect an 
improvement in swallowing function from better glot- 
tic apposition and closure, these procedures may not 
significantly improve the penetration and aspiration 
associated with UVCI when present. Furthermore, 
medialization thyroplasty may in fact lead to a slight 
worsening of swallowing in this setting. Patients 
should be counseled regarding this risk, especially if 
they are entering the procedure with a normal or near- 
normal swallow. It is possible that there are certain 
subsets of patients, for example, based on the causa- 
tion or duration of paralysis, that may derive more 
significant benefit with respect to swallowing func- 
tion from medialization procedures. Further study 
will be required to determine predictive characteris- 
tics for these patients. 
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BILATERAL VOCAL FOLD PARALYSIS AND ADHESION IN ANTERIOR 


SPINAL ARTERY SYNDROME 
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TOMOKO TSUJI, MD MASARU YAMASHITA, MD 
KOBE, JAPAN KOBE, JAPAN 


The purpose of this report is to present a rare case of anterior spinal artery syndrome (ASAS) in which there proved to be a 
combined lesion of paralysis and adhesion. A 26-year-old woman with a history of ASAS complained of difficulty of tracheal de- 
cannulation. In 1988, she was intubated and underwent tracheotomy because of respiratory muscle weakness, and she was decannulated 
in 1990. In 1998, she had cesarean delivery under general anesthesia, and postdelivery dyspnea necessitated tracheotomy again. On 
her first visit to us, endoscopic examination revealed bilateral vocal fold immobility at the midline without an apparent web. Direct 
laryngoscopy under general anesthesia revealed a posterior glottic adhesion and scarring, which were treated by excision of the scar 
and local steroid injection. The left vocal fold gradually regained mobility, permitting decannulation 3 months after treatment. This 








complicated vocal fold immobility »vas found to be due to adhesion and partial paralysis combined. 


KEY WORDS — anterior spinel artery syndrome, bilateral vocal fold paralysis, posterior glottic adhesion. 


INTRODUCTION 


Bilateral vocal fold immobility can be a life-threat- 
ening condition. Bilateral immobility of the vocal 
folds includes 4 conditions: paralysis, synkinesis, cri- 
coarytenoid joint fixation, and posterior glottic ad- 
hesion. The causes of paralysis are central nervous 
system disease, neuromuscular disease, and periph- 
eral nerve injury. Synkinesis is the result of aberrant 
reinnervation after a peripheral nerve injury. Crico- 
arytenoid joint fixation includes dislocation of the 
arytenoid cartilage, which is most frequently the re- 
sult of intubation trauma, and enkylosis of the crico- 
arytenoid joint, resulting from arthritis or connec- 
tive tissue disorder. Posterior glottic adhesion is some- 
times also noted as a sequel of intubation trauma.! 


The differential diagnosis among these is usually 
not difficult to make from the aistory and laryngeal 
findings. We encountered a patient with an 11-year 
history of anterior spinal artery syndrome (ASAS) 
who had bilateral vocal fold immobility. 


Anterior spinal artery syndrome is a rare and deva- 
stating condition caused by inadequate blood supply 
to the spinal cord from the anterior spinal artery. It 
has characteristic symptoms that consist of sudden 
onset of muscle weakness, flaccid quadriparesis with 
pain, dissociated sensory loss telow the level of the 
lesion, and bladder dysfunction. Respiratory dysfunc- 


tion can occur if the upper cervical spinal cord is 
involved.2,3 


. CASE REPORT 


A 26-year-old woman with an 11-year history of 
ASAS had difficulty with tracheotomy decannula- 
tion. In October 1988, she had sudden onset of pain 
in the neck, quadriplegia with flaccidity and pain, 
unconsciousness, and respiratory arrest. She was in- 
tubated for about 3 weeks and underwent tracheot- 
omy. She was weaned from ventilatory support in 
February 1990 and was decannulated in September 
1990. In September 1998, she had an uneventful ce- 
sarean delivery under general anesthesia, which was 
said to be possible only by forceful intubation with a 
tube of small diameter. Postdelivery dyspnea neces- 
sitated tracheal intubation for a week, and then she 
underwent tracheotomy again. ' 


On her first visit to us in March:1999, endoscopic 
examination revealed that both vocal folds were fixed 
at the midline (Fig 1); a normal voice could be pro- 
duced when the stoma was digitally closed. No web 
was noticed at that time. When closing the stoma, 
she had dyspnea and stridor upon respiration. A com- 
puted tomography scan showed a deformity of the 
lamina of the cricoid cartilage (Fig 2). Her upper and 
lower extremities were deformed by prolonged mus- 
cle weakness and resultant joint contractures. She 
was unable to move her right shoulder and elbow, 
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Fig 1. Preoperative endoscopic views. A) Ex- 
piratory phase. B) Inspiratory phase. 


but she was able to partially move both wrists and 
the left shoulder and left elbow. Paraplegia of her 
lower extremities had improved enough for her to 
walk independently. 


In May 1999, she underwent direct microscopic 
laryngoscopy under general anesthesia for operative 
evaluation of the cricoarytenoid joints and the pli- 
ability of the posterior glottic tissue. Adhesion and 
scarring around the vocal processes were found (Fig 
3), although preoperative endoscopic examination 
did not reveal the posterior glottic lesion. Arytenoid 
dislocation and cricoarytenoid joint ankylosis were 
not present on either side. During direct laryngos- 
copy, the adhesion was excised by cold instruments, 
and steroid (triamcinolone acetonide) was injected 
into the scar lesion. 

After the operation, the left vocal fold gradually 
regained some mobility, which was sufficient for both 








voicing and respiration. The right vocal fold remained 
immobile. She was decannulated 3 months after the 
surgery. There has been no recurrence of the poste- 
rior glottic adhesion and neither dyspnea nor stridor 
for the past 2 years (Fig 4). 


DISCUSSION 


The evaluation of a patient with bilateral vocal fold 
immobility should be based on a careful history and 
head and neck examination. With these alone, the 
majority of causes can be determined. The physical 
examination usually benefits from fiberoptic endos- 
copy with video recording.! The best method to eval- 
uate joint fixation, dislocation or ankylosis, and pos- 
terior glottic adhesion is suspension laryngoscopy un- 
der general anesthesia. The surgeon can palpate the 
vocal folds and arytenoid cartilages through direct 
laryngoscopy. This confirms whether the immobil- 


Fig 2. Computed tomography. A) At level of 
cricoid cartilage. B) Three-dimensional recon- 
structed image. 


682 Omori et al, Bilateral Vocal Fold Immobility 





ity derives from fixation of the cricoarytenoid joints 
or neural loss. The finding of passively mobile vocal 
folds strongly suggest that the problem is most likely 
a neuromuscular dysfunction. The pliability of the 
posterior glottic tissue can be assessed by instrumen- 
tal palpation. Soft tissue is more likely to be amen- 
able to endoscopic manipulation than is hard fibrotic 
tissue. The length of the stenotic segment is also as- 
sessed at this time.!4 


Clinical manifestations of ASAS are sudden on- 
set of flaccid quadriparesis with pain, dissociated sen- 
sory loss below the level of the lesion, and bladder 
dysfunction, which result from inadequate blood sup- 
ply to the spinal cord. The principal vascular supply 
to the spinal cord is the anterior spinal artery, which 
originates in its most rostral portion from the verte- 
bral arteries proximal to their union, forming the basi- 
lar artery at the ventral surface of the medulla oblon- 
gata. Lower cranial nerve paralysis may also occur 
as aresult of vertebrobasilar artery insufficiency. The 
causes of ASAS reportedly include arteriosclerosis, 


Fig 4. Postoperative endoscopic views (1 year 
8 months after surgery). A) During respiration. 
B) During phonation. 


Fig 3. Direct laryngoscopic views. Asterisks — 
forceps widening posterior glottis. A) Scar tis- 
sue (arrow) between vocal processes. B) Most 
of scar tissue has been excised. Tiny scar (ar- 
row) remains between inferior surfaces of vocal 
processes. C) After total excision of posterior 
glottic adhesion. D) Steroid injection into raw 
surface of scar tissue. 


infection, vasculitis, embolic events, sickle cell ane- 
mia, cervical cord herniation, surgery, and trauma. 
In our case, the cause of the syndrome was not clear. 


A diagnosis of bilateral vocal fold paralysis can 
usually be made without difficulty by fiberscopic ex- 
amination. However, it is not always easy to find a 
combination of bilateral paralysis and adhesion, un- 
less the adhesion is recognized on the upper surface 
of the vocal folds. Actually, in the present case, retro- 
spective repeated reviewings of the videotape record- 
ing of the laryngeal findings failed to reveal any sign 
of adhesion anywhere on the vocal folds. Therapeu- 
tic policy ts entirely different depending on whether 
there is bilateral vocal fold paralysis solely or a com- 
bination of paralysis and adhesion. After the patient 
discovered the web site of the Isshiki Clinic (Kyoto, 
Japan), she came to us for consultation, because the 
doctor at the institute she previously visited told her 
that the only way to solve the problem was to relieve 
the respiratory difficulty at the cost of her voice.>-7 


In our case, thorough laryngeal microexamination 
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under general anesthesia was performed first; it re- 
vealed, rather unexpectedly, the adhesion between the 
vocal processes. Neither arytenoid dislocation nor cri- 
coarytenoid joint ankylosis was found. After the op- 
eration, the left vocal fold gradually regained some 
mobility, but the right vocal fold was still immobile. 
The present case is an example of the occurrence of 
a combination of vocal fold paralysis and vocal fold 
adhesion. 


We would like to emphasize that the diagnostic 
key lies in the history. As mentioned above, the pa- 
tient had sudden-onset dyspnea as a symptom of 


-ASAS, which required intubation and tracheotomy 


in 1988. The dyspnea ameliorated gradually, and she 
was decannulated after 2 years. In-1998, delivery by 
cesarean operation was carried out under general an- 
esthesia, which was said to be possible only by force- 
ful intubation with a tracheal tube of small diameter. 
Afterward, she began to have respiratory distress 


again, necessitating tracheotomy. This history sug- 
gested that she had had a narrow glottis with unilat- 
eral or bilateral vocal fold paralysis since the onset 
of ASAS or previous intubation in 1988. It is most 
likely that the intubation through a narrow glottis 
caused local edema and a laceration wound, result- 
ing in adhesion, although the details were unknown 
because we could not get in touch with the physi- 
cians in charge at the previous hospital, which is re- 
mote from ours. 


Excision by cold instruments and local steroid in- 
jection were sufficient to solve the problem of adhe- 
sion between the vocal processes in the present case. 
The patient was fortunate in that restoration of one 
vocal fold restored the airway sufficiently for her to 
be decannulated. We advised her to avoid lengthy 
anesthesia with intubation through the narrow glot- 
tis, if possible, as it would precipitate recurrence of 
the adhesion. 
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Fatigue and weakness in the elderly are the functional consequences of underlying neuromuscular decline. However, little is 
known about the manifestations of aging in the larynx. This study evaluated the manner in which laryngeal senescence affects 
laryngeal-respiratory kinematics by videorecording laryngeal motion in both young and old rats. Recorded images were digitized, 
and glottal displacement and movement rate were measured. The results indicated that the amplitude of change in glottal angle was 
significantly diminished, and laryngeal movement durations were prolonged in the old animals. These findings may be due to func- 
tional constraints on the respiratory system, impaired laryngeal-respiratory interactions, or decrements in vocal fold tension with 
age. Because of the serious and pervasive nature of dysphagia and communicative impairments in the elderly, research that specifi- 
cally examines the manifestations and causes of these impairments is of great importance. 


KEY WORDS — aging, dysphagia, dysphonia, laryngeal movement, respiration. 


INTRODUCTION 


Fatigue and weakness in elderly persons are the 
functional consequences of underlying neuromuscu- 
lar decline.!> In the aged, altered or disorganized 
patterns of muscular contraction are observed, which 
may be associated with deficits in movement kine- 
matics during target-oriented or complex actions.*-6 
In particular, neuromuscular processes underlying 
complex, natural movements for swallowing and 
voice production appear to be impaired in the elder- 
ly.”"!° Because many people older than 65 years of 
age have some type of communicative impairment 
and 6 to 15 million elderly Americans may suffer 
from dysphagia,!!-!> it is imperative to define kine- 
matic deficits associated with these functions with 
the future goal of designing effective interventions. 


The thyroarytenoid (TA) muscle in the larynx is 
active during phonation, quiet respiration, and the 
pharyngeal phase of the swallow to facilitate glottal 
closure for airway protection.!®!8 In elderly humans 
and in aged animal models, decrements in contrac- 
tion amplitudes and increased motor unit durations 
within the TA muscle have been found with increas- 


ing age.!??! These alterations in TA muscle physiol- 
ogy may translate into restrictions in laryngeal kine- 
matics during swallowing, respiration, and phona- 
tion in the aged. However, such laryngeal kinematic 
deficits have not been described. Knowledge of pu- 
tative changes in laryngeal kinematics with aging may 
contribute to an improved characterization of the 
functional consequences of neuromuscular decline 
within the intrinsic laryngeal muscles. Such knowl- 
edge is important in the eventual design of adequate 
interventions in humans with dysphagia and vocal 
impairment. 


The aim of this experiment was to evaluate the 
manner in which laryngeal senescence affects laryn- 
geal kinematics in a rat model. The hypothesis was 
that older animals would manifest reduced vocal fold 
movement amplitudes and movement rates relative 
to young adult controls because of altered laryngeal- 
respiratory dynamics. 


METHODS 


Animals. Laryngeal kinematics were examined in 
10 old (24 to 26 months old) and 10 young (10 





From the Division of Otolaryngology—Head and Neck Surgery, University of Wisconsin Medical School, Madison, Wisconsin. This study was 
performed in accordance with the PHS Policy on Humane Care and Use of Laboratory Animals, the NIH Guide for the Care and Use of 
Laboratory Animals, and the Animal Welfare Act (7 U.S.C. et seq.); the animal use protocol was approved by the Institutional Animal Care and 
Use Committee (IACUC) of the University of Wisconsin. This research was funded in part by a grant from the Medical Research Service of the 
Department of Veterans Affairs and by a University of Wisconsin Medical School grant. 


A preliminary version of this work was presented at the Combined Scientific Poster Session of the American Broncho-Esophagological Asso- 
ciation and American Laryngological Association, Orlando, Florida, May 13-14, 2000, and received first place in the poster competition. 


CORRESPONDENCE — Nadine P. Connor, PhD, University of Wisconsin Clinical Science Center, Room K4/711, 600 Highland Ave, Madison, 


WI 53792-7375. 


Suzuki et al, Laryngeal-Respiratory Kinematics in Aged Rats 685 


Fig 1. Vocal fold angles were 
measured independently for the 
left (L) and right (R) sides of the 
glottis. A) Maximum vocal fold 
opening. B) Maximum vocal fold 
closure. Three lines were drawn 
in each picture: line H is horizon- 
tal line through posterior commis- 
sure (point P), and lines R and L 
are from posterior commissure to 
vocal processes on right and left 
sides, respectively. AnglesR1, R2, 


months old) Fischer 344 rats. The average body 
weight of the old animals was 424 g (SD, 28.6 g), 
and the average body weight of the young animals 
was 458 g (SD, 111.7 g); the difference was not sta- 
tistically significant (p = .52). In a preliminary study 
of 12 young rats, the minimal level of anesthesia re- 
quired for microlaryngoscopic recording was deter- 
mined. The preliminary study is reported in the Ap- 


pendix, !7.22-24 


Videomicrolaryngoscopy. To allow microlaryngo- 
scopic visualization and recording, the rats were anes- 
thetized via an intraperitoneal injection of ketamine 
hydrochloride 90 mg/kg and xylazine hydrochloride 
9 mg/kg. This was previously determined to be the 
minimum dose of ketamine and xylazine that allowed 
insertion of a simple suspension microlaryngoscope 
(see Appendix). The animals were then placed into a 
malleable lead operating platform in a near-vertical 
position with the mouth secured open.?? A simple 
suspension microlaryngoscope was designed and fab- 
ricated from |-mm-diameter steel wire for visualizing 
the rat larynx. The microlaryngoscope was inserted 
into the oropharynx, and the position of the rat was 
adjusted to obtain a view of the larynx under a dis- 
section microscope fitted with a beam-splitter and a 
video camera (Zeiss, Thornwood, New York). 


Laryngeal-respiratory movements were recorded 
on video at 15, 20, 25, and 30 minutes after induc- 
tion of anesthesia. Two minutes of laryngeal motion 
during quiet respiration were recorded on videotape 
at 30 frames per second. The first 60 seconds of la- 
ryngeal movement were digitized with a Miro video- 
board and commercially available software (Adobe 
Premiere). 


Measurements. The laryngeal movement angles 
were measured on a personal computer with Scion 
Image analysis software to evaluate the change in 
glottal angle and laryngeal-respiratory durations in 
young and old animals. The measurements were 





H R2 -482 


L1, and L2 were measured with i O El 
Scion Image 1.62c software (Sci- 

on Corporation, Frederick, Mary- P 
land). 


made from the digitized glottal images by procedures 
similar to those we have reported previously.?* Spe- 
cifically, the first 10 glottal frames that demonstrated 
either maximum vocal fold closure (adduction) or 
maximum vocal fold opening (abduction) in each 60- 
second sample of laryngeal movement were mea- 
sured. All laryngeal angle measurements were made 
at the posterior commissure of the rat larynx, because 
the anterior commissure was obscured in all images. 
The following measurements were made for each 
sample of laryngeal-respiratory movement and are 
shown diagrammatically in Fig 1: 1) posterior glot- 
tal angle (degrees) of maximum vocal fold closure 
(adduction) and 2) posterior glottal angle (degrees) 
of maximum vocal fold opening (abduction). In ad- 
dition, a temporal measure was made of the average 
duration (in seconds) of an adduction-abduction cy- 
cle, across the first 20 seconds of digitized laryn- 
geal-respiratory movement. 


Two variables were then compared in the old and 
young groups: |) change in glottal angle (degrees), 
which was calculated as the change in posterior glot- 
tal angle between maximum abduction and maximum 
adduction, and 2) average movement duration (sec- 
onds) of an adduction-abduction cycle. 


RESULTS 


Comparisons were made in change in glottal angle 
and movement duration measures at 20 minutes af- 
ter induction of anesthesia in old versus young groups. 
As depicted in Figs 2 and 3, the amplitude of change 
in glottal angles was substantially diminished in the 
old animals (Wilcoxon rank sum test, p = .01). The 
average movement duration of an adductory-ab- 
ductory period was significantly longer in the old ani- 
mals, as shown in Fig 4 (Wilcoxon rank sum test; p 
= .005). 


Laryngeal movement data were also evaluated 15, 
20, 25, and 30 minutes after induction of anesthesia 
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Abduction 


Adduction 


Young 


Aged 


to investigate whether anesthesia level may differ- 
entially affect laryngeal-respiratory durations in old 
versus young animals. A linear relationship was as- 
sumed to exist between the log of time and the move- 
ment duration. The intercepts and slopes for the old 
and young animals were estimated by a least squares 
regression analysis. A likelihood ratio x? test was 
used to test for differences in these estimates across 
age groups. The results of these comparisons indi- 
cated that significant differences did not exist be- 
tween age groups in the slope of the relationship be- 
tween postanesthesia time and movement duration 
(p =.73). However, the intercepts were significantly 
higher in the old animals (p = .00001); this finding 
indicated that the glottal cycle duration was longer 
50 
45 
40 
35 
30 
25 


Change in Glottal Angle (degrees) 





Young Old 


Fig 3. Each circle represents average change in glottal 
angle for | rat. As evident from distribution, change in 
glottal angle was significantly reduced in old rats rela- 
tive to young adult rats (p = .01). 





Fig 2. Schematic diagram represents mean 
maximum abduction and adduction angles at 
posterior commissure in old and young rats. 
Standard deviations are shown as dashed lines. 
Change in glottal angle was diminished in old 
rats. 


in the old animals initially, but then followed the same 
pattern across time as that in the young animals dur- 
ing anesthesia recovery. 


DISCUSSION 


The goal of this study was to examine laryngeal- 
respiratory kinematics in rats, with the hypothesis that 
temporal and amplitude deficits in vocal fold move- 
ment would be manifested in older animals. Our re- 
sults supported this hypothesis. In the old rats, the 
amplitude of change in glottal angle was reduced, 
and movement durations were prolonged during quiet 
respiration. These findings may be indicative of func- 


3 


Movement Duration 
(sec) 





Young Old 


Fig 4. Each circle represents average duration for | com- 
plete glottal excursion for each rat. Movement duration 
of adduction-abduction periods was significantly longer 
in old rats than in young adult rats during quiet respira- 
tion (p = .005). 
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tional constraints on the respiratory system, impaired 
laryngeal-respiratory dynamics, or decrements in vo- 
cal fold tension with age. The relationship of these 
findings to underlying muscle activity, respiratory 
function, and associated anatomic changes must be 
determined to clarify the mechanism of these age- 
related differences. 


Within the larynx, a potential cause for reductions 
in laryngeal movement range and rate may be age- 
related atrophy of laryngeal muscle fibers. Because 
peripheral denervation-reinnervation has been postu- 
lated as a major cause of age-related changes in mus- 
cle structure and function in the limbs,?>:?6 it may be 
expected that laryngeal muscle would manifest pro- 
gressive denervation-like changes. This hypothesis 
must be studied directly via further physiological and 
morphological studies of senescent laryngeal muscle. 


A potential limitation of this study was that the 
kinematic measurements were made in anesthetized 
animals. However, in this animal model, the use of 
anesthesia was required for insertion of the laryngo- 
scope. Ketamine anesthesia may have depressed the 
central nervous system in our animals and reduced 
the respiratory rate. An alteration in respiratory rate 
would directly affect laryngeal kinematics during 
quiet respiration, and may have affected the results 
of this study. To minimize this effect, we performed 
a preliminary study in which the anesthesia level was 
varied (see Appendix). On the basis of the results of 
this preliminary study, we used the minimum required 
dosage for insertion of the microlaryngoscope. How- 
ever, the use of minimal anesthesia may not have been 
necessary, because the level of anesthesia did not ap- 
pear to have a significant effect on our measures. 
Specifically, it was demonstrated in the preliminary 
study (see Appendix) that change in glottal angle and 
movement duration were not significantly different 
across time as a function of anesthesia dosage in 
young animals. Furthermore, the anesthesia level did 
not appear to have a differential effect on our mea- 
sures across time in young versus old animals. We 
found that although old animals have longer laryn- 
geal movement durations at 15 minutes after anes- 
thesia than the young animals, the profile across time 





was not different between groups. That is, both groups 
of animals recovered from anesthesia with similar 
manifestations in laryngeal movement duration and 
change in glottal angle. These findings suggest that 
the anesthesia level was likely not a factor in the dif- 
ferences we observed between the age groups. 


The use of rats in research concerning the upper 
aerodigestive tract has been a relatively recent ad- 
vance.?2-24,27-31 The rat appears to be a good model 
for the study of aging laryngeal function and was 
chosen for use in this study because of our labora- 
tory’s technical expertise with rats, and a number of 
scientific considerations. The rat has been the most 
frequently used animal for examining age-related 
changes in muscle,32-33 so we could make a straight- 
forward comparison of laryngeal motor data with in- 
formation found abundantly in the literature derived 
via study of other muscle systems. Importantly, the 
relative magnitude of age-related muscle loss in ro- 
dents corresponds to that typically reported for hu- 
mans.?? As such, the rat model is relevant to changes 
observed in aging humans. 


With the microlaryngoscopic system developed in 
our laboratory, alterations in laryngeal function as a 
function of aging in a rat model can be easily re- 
corded. The fact that the rats in this study were only 
24 to 26 months old and still manifested age-related 
changes in laryngeal function is compelling and war- 
rants further investigation in older animals to deter- 
mine whether such changes are progressive. 


Because of the pervasive and life-threatening na- 
ture of swallowing deficits in the elderly, studies 
geared toward quantification of the underlying mani- 
festations appear crucial. Along with dysphagia, im- 
pairments in phonatory function are also reported in 
the elderly.?34.35 Underlying these vocal manifesta- 
tions may be reductions in electromyographic ampli- 
tudes!%.20 in the intrinsic laryngeal muscles, which 
likely contribute to restrictions in vocal fold move- 
ment range and increased vocal fold movement du- 
rations. However, these putative age-related laryn- 
geal motor restrictions have not been clearly demon- 
strated in elderly humans and should be studied fur- 
ther. 
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APPENDIX 


Before examining laryngeal kinematics in aged animals, we undertook a preliminary experiment with 14 young rats 
(10 months old) to determine the appropriate level of ketamine-xylazine anesthesia for use in microlaryngoscopic re- 
cordings. Because there have been inconsistent reports in the literature of thyroarytenoid muscle suppression during 
sleep and anesthesia in humans and some animal species, it was our goal to use the smallest level of anesthesia required 


for microlaryngoscopy.!7 


The 14 young rats were divided into 3 anesthesia dosage groups and were anesthetized via intraperitoneal injection. 
Two of the rats were measured on separate occasions after receiving 2 of the dosages. The groups were 5 animals that 
received ketamine hydrochloride 67.5 mg/kg and xylazine hydrochloride 6.75 mg/kg (group A), 6 animals that received 
ketamine hydrochloride 90.0 mg/kg and xylazine hydrochloride 9.0 mg/kg (group B), and 5 animals that received ketamine 
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hydrochloride 112.5 mg/kg and xylazine hydrochloride 11.25 mg/kg (group C). These dosages were chosen to bracket 


values used in our previous work with the rat larynx.??-*4 

For the lowest doses of ketamine and xylazine (group A), only 3 of the 5 rats achieved an anesthesia level consistent 
with visualization of the larynx. Although all group A animals were unresponsive to tail pinch stimulation at these 
dosages, a gag reflex was induced in 2 of the 5 rats upon microlaryngoscope insertion. Accordingly, dosage A was 
deemed too low for our purposes. For all animals in groups B and C, the anesthesia level was sufficient to allow 
microlaryngoscope insertion. Accordingly, the lower of the 2 dosages (B) was chosen for use in comparisons of laryn- 
geal movement characteristics in old versus young rats. 


Change in glottal angle and movement duration data were compared across time in the group B and C rats to allow 
assessment of the effect of anesthesia level in young animals on laryngeal-respiratory kinematics. A linear relationship 
was assumed to exist between either movement duration or change in glottal angle and the log of time. Intercepts and 
slopes were estimated with a least squares regression analysis. A mixed model with a random rat effect and a likelihood 
ratio x2 test were used to test for differences in these estimates between groups. The reciprocals of the parameter esti- 
mate variances were used as weights. The results indicated no differences in the intercepts or slopes for either of the 2 
variables (p > .37). Therefore, laryngeal movement duration and change in glottal angle values were not significantly 
different in the 2 anesthesia groups at 15 minutes after anesthesia, and then they followed roughly similar patterns across 
time during anesthesia recovery. Accordingly, it appeared that anesthesia dosage did not have a significant effect on 
laryngeal movement parameters. However, the lower of the 2 doses (dosé B) was used in comparisons of old versus 
young animals. 
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IDIOPATHIC SUBGLOTTIC STENOSIS REVISITED 


TULIO A. VALDEZ, MD STANLEY M. SHAPSHAY, MD 
BOSTON, MASSACHUSETTS 


Idiopathic subglottic stenosis (ISS) is a rare inflammatory process of unknown cause, usually limited to the subglottic region and 
the first 2 tracheal rings. We performed a retrospective analysis of our experience with a series of patients with this condition. The 
study involved retrospective review of the records of patients with ISS. A series of criteria that included patient clinical history, 
laboratory tests, flexible nasolaryngoscopic examination, and biopsies were used to establish a diagnosis. Symptoms, treatment, and 
outcome were examined. A total of 16 patients were identified; 14 were female (87.5%), and 2 were male. The mean follow-up time 
was 75.5 months. Fourteen patients required surgical treatment for respiratory compromise. Nine of these cases have been controlled 
effectively with endoscopic laser techniques. Endoscopic management failed in 5 patients, and 2 patients underwent laryngotracheal 
resection and reconstruction. Mitomycin-C was used in 6 patients as an adjuvant to endoscopic laser surgery. Patients in whom 
endoscopic treatment failed were noted to have thicker (>1 cm) and more complex stenoses. The diagnosis of ISS is a diagnosis of 
exclusion. A complete workup of the patient must be performed to rule out other causes of stenosis. Endoscopic laser surgery is a 
valid initial approach for thinner, noncomplicated lesions. Patients with thicker, complex lesions in whom endoscopic treatment fails 
are best managed with laryngotracheal resection and reconstruction. However, adjuvant use of mitomycin-C may prove beneficial in 
the treatment of these patients. Estrogen may play a role in the pathogenesis of ISS by altering the wound healing response. 


KEY WORDS 





airway stenosis, estrogen, laser treatment, mitomycin-C. 


INTRODUCTION sizing the diagnosis and treatment of this condition. 
Subglottic stenosis is most commonly the result A new hypothesis regarding the probable role of es- 
of mechanical trauma from endotracheal intubation. trogen will be presented and discussed. 


Other known causes of acquired subglottic stenosis 


in adults are Wegener’s granulomatosis, collagen vas- MATERIALS AND METHOHS 


cular diseases, external trauma, surgery, and upper A retrospective chart review of 16 patients was 
respiratory tract infections.! In a small number of performed. This series includes 6 previously unre- 
cases, however, the cause of subglottic stenosis can- ported cases and a retrospective analysis and follow- 
not be determined; hence the term idiopathic subglot- up of 10 previously reported cases.? Data were com- 
tic stenosis (ISS). piled with regard to gender, age at presentation, pre- 


vious endotracheal intubation or local trauma, GERD 
symptoms and treatment, previous airway infections, 
number and type of surgical procedures, and other 
possibly related systemic illnesses. Laryngoscopy 
and bronchoscopy under general anesthesia were per- 
formed in all patients to assess the extent of the le- 
sion and obtain a biopsy specimen. 


Idiopathic subglottic stenosis is a rare, slowly pro- 
gressive, nonspecific inflammatory process of un- 
known cause, usually limited to the subglottic re- 
gion and first 2 tracheal rings. It occurs predomi- 
nantly in women, usually between 30 to 50 years of 
age, with only a few cases reported in male patients.?3 


The pathogenesis of ISS remains speculative. Gas- 
troesophageal reflux disease (GERD), autoimmune 
diseases, and previous respiratory tract infections 
have all been proposed as possible causes. Addition- 
ally, the marked predominance for female patients 
gives rise to hypotheses suggesting a possible hor- 
monal component. However, various studies have re- 
ported a lack of estrogen receptors in these stenotic 


Histopathologic analysis to rule out neoplasm, 
granuloma, vasculitis, infection, and other systemic 
connective tissue disorders, as well as to test for es- 
trogen receptors, was performed on biopsy specimens 
from all patients. The patients were also tested for 
antineutrophil cytoplasmic antibodies (ANCA) to rule 
out Wegener’s granulomatosis. Intraoperative mea- 
‘ es surements of the length of the lesion were routinely 
siots, ang to date, no clear association has been performed with rigid ventilating bronchoscopes. All 
found.*4~ lesions were photographed before and after treatment. 

We report a series of 16 patients with ISS, empha- The percentage of stenosis was estimated in each case 
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PATIENT DATA 
Age at Length of No.of Preoperative Postoperative Follow-up 

Patient Sex Diagnosis(y) Lesion(mm) Dilations Stenosis (%) Stenosis (%) Outcome (mo) 
1 M 73 6 0 29 NA NA 52 
2 F 34 7 0 32 NA NA 128 
3 F 49 6 1 44 20 Favorable 90 
4 F 38 8 1 54 36 Favorable 136 
5 M 53 5 2 62 38 Favorable 187 
6 F 38 9 2 67 50 Favorable 82 
7 F 46 18 2 59 66* Failed treatment 78 
8 F 33 15 5 55 75* Failed treatment 88 
9 F 39 10 2 56 28 Favorable 64 
10 F 25 16 16 61 52 Failed treatment 84 
ll F 37 14 8 75 40 Failed treatment 40 
12 F 42 16° 1 64 46 Favorable 18 
13 F 40 13 1 62 55 Favorable 45 
14 F 40 19 2 48 54 Failed treatment 34 
15 F 13 7 I 72 28 Favorable 38 
16 F 32 4 1 56 29 Favorable 44 


NA — not applicable. 
*Underwent laryngotracheal resection and reconstruction. 





with a software program (Image Measure, Microsci- 
ence, Washington, DC) from previously scanned and 
digitized photographs. This percentage was obtained 
by calculating the total area of the airway and sub- 
tracting the area of the stenosis. 


Patients requiring operative procedures to improve 
the size of the airway were initially managed endo- 
scopically by a laser incision and dilation technique 
first described by the senior author (S.M.S.) in 1987.6 
Surgical exposure was obtained by suspension laryn- 
goscopy with the operating microscope set at 16x. 
Adequate exposure of the subglottis was obtained 
with the Dedo diagnostic anterior commissure—type 
laryngoscope (Pilling Co, Philadelphia, Pa). General 
intravenous anesthesia was used, and ventilation was 
maintained through a jet technique with a clip-on jet 
needle on the proximal posterior edge of the laryn- 
goscope. All patients received perioperative proton 
pump inhibitors and antibiotics. Radial incisions of 
the subglottis were performed with a carbon dioxide 
laser with a microspot micromanipulator. The inci- 
sions were made with superpulse mode setting of the 
laser with a short (0.2 second) exposure that limited 
heat conduction in the tissue. Ventilating rigid bron- 
choscopes of progressively larger diameters were 
then used for dilation of the subglottic stenotic seg- 
ment. The incisions served to guide the dilating forces 
in a controlled manner to preserve islands of viable 
epithelium between each incision. After the comple- 
tion of the incisions, adjuvant therapies to prevent 
restenosis were used: intralesional steroid injections 
in 8 patients and mitomycin-C in 6 patients. 


Cases that did not improve after 3 endoscopic pro- 





cedures were considered treatment failures, and pri- 
mary laryngotracheal resection and reconstruction as 
described by Grillo et alt was recommended. 


RESULTS 


Sixteen patients with ISS were observed in this 
study with a follow-up period ranging from 18 to 187 
months (see Table). The mean follow-up time was 
75.5 months. The diagnosis of ISS was made by ex- 
cluding known causes of subglottic stenosis, empha- 
sizing absence of recent endotracheal intubation, trau- 
ma, and airway infections. Fourteen of these patients 
were female. The age at presentation ranged from 
13 to 73 years. The mean age at presentation was 
36.1 years for female patients and 63 years for male 
patients. 


Fourteen patients presented with dyspnea as their 
main complaint. One patient was asymptomatic, and 
was found to have subglottic stenosis as an inciden- 
tal radiographic finding. The remaining patient pre- 
sented with stridor and a sporadic cough. 


A history of endotracheal intubation was found in 
9 of 16 patients. All of these procedures took place 
at least 4 years before the onset of symptoms. None 
of these patients had prolonged endotracheal intuba- 
tion. Three patients had a history of respiratory in- 
fections, ranging from 1 to 3 years before the onset 
of symptoms. 


Biopsy specimens from all patients showed dense 
fibrosis with sparse and nonspecific areas of inflam- 
mation. Tests for estrogen receptors and routine cul- 
tures for all biopsy specimens were negative. Cyto- 
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plasmic and perinuclear pattern antineutrophil cyto- 
plasmic antibodies were absent in the 13 patients 
tested. 


Fourteen patients required surgical procedures to 
improve the size of their airway because of the se- 
verity of their symptoms. A total of 45 incision and 
dilation procedures were performed in these patients. 


Eight patients were found to have long stenotic 
lesions (>I cm in length). Multiple endoscopic laser 
procedures failed in 5 of these patients. Two patients 
underwent laryngotracheal resection and anastomo- 
sis and are currently asymptomatic, without evidence 
of restenosis. One patient is scheduled to undergo 
laryngotracheal resection and anastomosis in the near 
future. The remaining 3 patients in whom endoscopic 
treatment failed have declined to undergo an open 
procedure and continue periodic incision and dila- 
tion with intraoperative mitomycin-C. A total of 37 
incision and dilation procedures were performed in 
this group of 8 patients with long stenotic lesions 
(average, 4.6 procedures). It is to be noted that pa- 
tients who were still symptomatic after 3 laser pro- 
cedures were offered treatment by open surgical re- 
section of the stenosis and reanastomosis. 


Eight patients presented with lesions of less than 
1 cm in length. Two patients had mild or no symp- 
toms and were managed conservatively (observa- 
tion). The remaining 6 patients in this group re- 
sponded favorably to endoscopic laser incision and 
dilation (average, 1.3 procedures) and are currently 
asymptomatic, with an adequate airway. 


The use of adjuvant therapy to prevent restenosis 
has changed during the course of the years. Eight 
patients from our previously reported series received 
intralesional steroid injections to prevent restenosis 
after the surgical procedure. No clear benefit was 
shown with this practice; thus, it has been abandoned. 


Eight procedures have been performed with mito- 
mycin-C. Four patients have maintained an increase 
in the size of the airway and remain asymptomatic 
after a single procedure with mitomycin-C. Two pa- 
tients have required a second procedure for further 
improvement of symptoms. One of these patients is 
scheduled to undergo laryngotracheal resection and 
anastomosis in the near future (patient 14). The other 
patient (patient 10) who required a second procedure 
with mitomycin-C has shown an increase in the size 
of her airway and is currently asymptomatic. 


Two patients who presented with reflux symptoms 
were found to have marked erythema in the poste- 
rior glottis and subglottic region. Neither pH probe 
studies nor barium swallow studies were performed 
in these 2 patients. Both patients were started on an- 


tireflux medication that resulted in control of their 
symptoms. One of these patients had improvement 
of airway symptoms that correlated to an increased 
airway lumen on examination after antireflux therapy. 
No changes in airway size were noted in the other 
patient treated for GERD. All patients received anti- 
reflux medication during the perioperative period, 
with a duration of treatment ranging from 7 to 21 
days. 


DISCUSSION 


Idiopathic subglottic stenosis is a diagnosis of ex- 
clusion. Only when all known causes have been ruled 
out by careful analysis of the patient’s history, physi- 
cal examination, and histopathologic and immuno- 
logic tests can this diagnosis be made. 


Patients with ISS may present with dyspnea, stri- 
dor, and even life-threatening airway obstruction. As 
these patients are otherwise healthy, they are often 
treated for asthma by their primary care physicians 
until they are referred for airway endoscopy or im- 
aging studies. Indeed, 3 patients in our series were 
initially treated for asthma before subglottic steno- 
sis was diagnosed. 


As ISS occurs predominantly in women, theories 
involving an estrogen effect have been proposed. 
Several authors, however, have failed to find estro- 
gen receptors in these lesions.24 Our data confirm 
this finding, since estrogen receptors were also ab- 
sent in our group of patients. Gastroesophageal re- 
flux also has been suggested both as an etiologic fac- 
tor and a contributing factor in ISS.57 Two of our 
patients presented with symptoms and findings at- 
tributed to GERD and consequently received anti- 
reflux medication (proton pump inhibitors). One of 
these patients showed significant improvement of the 
stenosis with long-term (greater than 3 months) anti- 
reflux medication. Other authors have reported simi- 
lar findings after initiating antireflux therapy in pa- 
tients with subglottic stenosis.° Although GERD does 
not explain the greater female incidence of this dis- 
ease, and the number of GERD patients in the gen- 
eral population is not proportional to the handful of 
patients with ISS, we believe that in some patients 
GERD may play a role both as the initial injury and 
as a chronically present aggravating factor. Whether 
all patients should be tested for reflux with pH probes 
or managed symptomatically with antireflux medi- 
cation remains a controversial topic. However, if 
used, pH probe studies should be performed with a 
double probe. Currently, our patients are managed 
with antireflux medication for | week before sur- 
gery and for 21 days after surgery to allow for ade- 
quate wound healing, and symptomatic patients re- 
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ceive a long course of proton pump inhibitors. 


Previously undiagnosed granulomatous diseases 
have also been described as underlying causes of 
ISS.8.9 Wegener’s granulomatosis limited to the lar- 
ynx and trachea has been well documented in the 
literature.!0-12 Tracheal and laryngeal compromise 
is not arare manifestation of Wegener’s granuloma- 
tosis, with a reported incidence ranging between 16% 
and 23%.10.11 In 1992, de Vries et al8 reported posi- 
tive ANCA in 6 patients without other signs of Wege- 
ner’s granulomatosis. Hoare et al!3 observed detect- 
able ANCA titers in patients with Wegener’s granu- 
lomatosis and subglottic stenosis. Since histologic 
analysis of biopsy specimens can be nonspecific in 
many cases of Wegener’s granulomatosis, ANCA ti- 
ter studies have been advocated as mandatory in pa- 
tients with subglottic stenosis of undetermined ori- 
gin. Ruling out Wegener’s granulomatosis is particu- 
larly important because of the potential systemic 
compromise that may ensue, and the good response 
it can have to medical therapy. 


The results of our study corroborate previous find- 
ings that established the prognostic factors for suc- 
cess or failure of endoscopic surgical procedures. !4 
Along with cartilaginous collapse, the vertical length 
of the lesion continues to be the single most impor- 
tant predictive factor for successful outcome after 
endoscopic laser surgery in cases of subglottic steno- 
sis. In our series, patients with ISS can be divided 
into 2 groups according to their response to treat- 
ment and the severity of the disease. The first group 
consists of patients with short (<1 cm in vertical 
length) and soft stenotic lesions usually associated 
with mild symptoms. Observation alone or 1 to 2 en- 
doscopic laser incision and dilation procedures usu- 
ally provides good results in these patients. 


The second group of patients has a more exten- 
sive type of stenosis. These patients usually have a 
long (1 to2 cm), complex, and disorganized scar for- 
mation better described as a cicatrix. These patients 
may show initial improvement with laser incision and 
dilation, but stenosis recurs gradually, usually over 
a period of 6 to 9 months. The laser incision is usu- 
ally uncomplicated. However, there is commonly 
some hypervascular tissue, particularly in the poste- 
rior aspect of the stenosis, which bleeds easily. This 
bleeding, albeit troublesome, particularly during Ven- 
turi jet ventilation, is controlled relatively easily with 
tamponade with a cottonoid soaked in 1/1,000 epi- 
nephrine. Our results have been poor with this type 
of lesion, with patients eventually requiring a defini- 
tive laryngotracheal resection and reconstruction. 
Grillo et alf reported excellent results with this tech- 
nique in a series of 35 patients with ISS who under- 


went resection and anastomosis, with 33 patients 
maintaining a good airway after the procedure. 


We recommend a conservative approach in those 
cases in which the patient presents for initial evalua- 
tion and treatment. A complete workup including air- 
way computed tomographic imaging, chest x-ray, 
ANCA titer assays, and pulmonary function studies 
is done before endoscopy under anesthesia. Preop- 
erative laryngotracheoscopy under topical anesthe- 
sia in the outpatient clinic gives valuable informa- 
tion regarding vocal cord mobility and the existence 
of airway collapse during respiration. Photographic 
documentation is performed in every case. Exami- 
nation with complete laryngoscopy and bronchos- 
copy is done under general anesthesia as described, 
and biopsy of the stenosis is done with histologic 
analysis and culture studies. A trial of endoscopic 
treatment is warranted in all cases before a consider- 
ation for an open resection of the stenosis and reanas- 
tomosis. In patients with reflux symptoms, antireflux 
medication should be started and the patient should 
be observed for improvement of symptoms before 
any surgical procedure. It takes at least 3 months for 
any improvement from such therapy. 


The use of topical mitomycin-C at the time of en- 
doscopic laser surgery may be beneficial in prevent- 
ing restenosis after surgery. Mitomycin-C is an anti- 
neoplastic agent that has been shown to modulate 
the wound healing response by inhibiting fibroblast 
proliferation. Both clinical and animal studies have 
shown promising results with this new adjuvant ther- 


apy in the treatment of tracheal and laryngeal steno- 
is, 15.16 
sis. 1> 


Four of 6 patients treated have maintained a stable 
airway after a single application of mitomycin-C, in- 
cluding 2 patients with long and complex stenotic 
lesions. One patient who had previously required la- 
ser incision and dilation procedures every 6 months 
has increased the interval between procedures to a 
year. This patient (patient 10) has refused open re- 
section and anastomosis surgery. 


It is important to note, however, that mitomycin- 
C does not halt the process of fibrosis completely 
and that some degree of restenosis usually occurs. A 
larger series of patients will be required to evaluate 
the adjuvant effect of mitomycin-C in the treatment 
of thick stenotic lesions. 


Despite the lack of estrogen receptors in the le- 
sions obtained from our group of patients, we be- 
lieve that estrogen plays a central role in the patho- 
genesis of ISS. Various reports have described the 
important role of estrogen in wound healing.!7.18 Es- 
trogen has been shown to increase the secretion of 
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transforming growth factor B1 (TGF-B1) from fibro- 
blasts. It is known that TGF-B1 promotes early ex- 
tracellular matrix production, fibrosis, and deposi- 
tion of collagen types I and III.'7:!9 Other wound heal- 
ing alterations, such as keloids, are characterized by 
fibroblast overproduction of these types of collagen 
secondary to an increase in TGF-B1. One study dem- 
onstrated that tamoxifen citrate, a synthetic, nonste- 
roidal antiestrogen, inhibits fibroblast proliferation 
and reduces collagen production by decreasing TGF- 
B production. !° 


Another mechanism through which estrogen influ- 
ences wound healing is by altering the inflamma- 
tory response. Estrogen has been shown to be a strong 
inhibitor of neutrophil chemotaxis.? Suppression of 
neutrophil accumulation results in reduced levels of 
neutrophil elastase, which is responsible in large part 
for fibronectin degradation. Increased levels of lo- 
cal fibronectin enhance fibroblast influx, collagen 
deposition, and wound contraction. Although in- 
creased collagen deposition and wound contraction 
are synonymous with an adequate wound healing re- 
sponse, they can be a major cause of concern in the 
airway. Since the subglottis and trachea are tubular 
structures, increased collagen deposition followed by 
wound contraction may indeed lead to airway steno- 
sis by approximation of the luminal surfaces. 


We believe that estrogen-influenced collagen dep- 
osition following a long-forgotten or undetected mu- 
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Schematic of possible role of estrogen-sensi- 
tive wound healing response in causation of 
idiopathic subglottic stenosis. TGF B — trans- 
forming growth factor B; GERD — gastro- 
esophageal reflux disease. 





Idiopathic 
Subglottic 
Stenosis 


cosal insult may be the cause behind ISS. Upper res- 
piratory tract infections and GERD may participate, 
both as initiating factors and as aggravating factors, 
by creating cyclical episodes of insult and inflamma- 
tion that enhance an estrogen-sensitive wound heal- 
ing response (see Figure). 


These interactions may indicate a role of estrogen 
in the pathogenesis of ISS despite the lack of detect- 
able estrogen receptors. The role of estrogen may 
provide an explanation for the overwhelming major- 
ity of ISS cases in young women, which has remained 
unexplained. Studies evaluating the role of estrogen 
in subglottic wound healing are currently under way 
to explore this hypothesis and possible therapeutic 
implications. 


CONCLUSIONS 


Idiopathic subglottic stenosis is a diagnosis of ex- 
clusion. A complete workup of the patient must be 
performed to rule out other causes of stenosis. En- 
doscopic laser surgery is a valid initial approach for 
thin, noncomplicated lesions. Patients with thick and 
complex lesions in whom endoscopic laser treatment 
has failed are best managed with laryngotracheal re- 
section and reconstruction. Adjuvant use of mitomy- 
cin-C may prove beneficial in the treatment of these 
patients. Estrogen may play an important role in the 
pathogenesis of ISS by altering the wound healing 
response. 
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The role of viridans group streptococci (Streptococcus oralis) in the prevention of colonization with nontypeable Haemophilus 
influenzae and Moraxella catarrhalis was investigated in an adenoid organ culture system. The adenoids from 100 patients who were 
undergoing adenoidectomy for either hypertrophy or recurrent otitis media were used. Streptococcus oralis Parker uniformly inhib- 
ited colonization with nontypeable H influenzae or M catarrhalis over a 24-hour period of incubation in adenoid organ culture. 
Streptococcus oralis Booth, a noninhibitory strain, did not significantly reduce colonization with nontypeable H influenzae and M 
catarrhalis. The results indicate that some strains of S oralis may inhibit colonization with potential pathogens in the nasopharynx. 
It is therefore possible that colonization with inhibitory strains of viridans streptococci may be used in the nasopharynx as a relatively 
safe and inexpensive approach to prevention of recurrent otitis media in some children. 


KEY WORDS — adenoid, bacterial interference, Moraxella catarrhalis, nontypeable Haemophilus influenzae, Streptococcus 


oralis. 
INTRODUCTION 


The phenomenon whereby one type of bacteria in- 
habiting a given host tissue prevents colonization 
with a second microorganism is known as bacterial 
interference.! Its application in the form of bacterio- 
prophylaxis has successfully stemmed epidemics of 
virulent staphylococci in infants,? has remedied 
chronic furunculosis,* and has prevented experimen- 
tal infections of burned laboratory animals.* Further- 
more, certain laboratory-derived and naturally occur- 
ring oral bacteria are promising effector strains for 
the replacement therapy of dental infectious diseases.5 
Possible explanations for interference include pro- 
duction of specific antagonistic factors such as bac- 
teriocins (bacterial polypeptides), production of hy- 
drogen peroxide, competition for nutrients,’ and se- 
lective alteration of the local physical-chemical en- 
vironment.8 Therapeutic intervention using the con- 
cept of bacterial interference has been performed by 
a number of clinical investigators in Sweden. The 
use of &-hemolytic streptococci, which are members 
of the normal commensal flora of the oral cavity and 
nasopharynx, has been successful in the treatment of 
epidemic B-hemolytic streptococcal pharyngitis? and, 
most recently, in the treatment of otitis-prone chil- 


dren.' Our laboratory has previously described the 
microbial ecology of the nasopharyngeal bacterial 
flora in otitis-prone and non-otitis-prone children. 11-12 
We recently studied 2 strains of Streptococcus oralis. 
One strain (Parker) inhibited all nontypeable Hae- 
mophilus influenzae (NTHi) that were tested.!3 The 
second strain (Booth) did not inhibit any strains of 
NTHi in vitro.!3 The major difference detected be- 
tween the strains was the substantially faster rate of 
growth of the Parker strain under both aerobic and 
anaerobic conditions. ! The present study extends our 
previous findings by means of an adenoidal organ 
culture system in which the kinetics of bacterial in- 
terference were quantitatively studied. 


MATERIALS AND METHODS 
STUDY POPULATION 


We obtained adenoidal tissue from 100 children 
with a history of otitis media (either recurrent acute 
otitis media or otitis media with effusion; n = 40) or 
with adenoidal and/or tonsillar hypertrophy (n = 60). 
There was no selection with regard to day-care atten- 
dance or allergy. None of the children had clinical 
signs of acute infection at the time of adenoidectomy. 
Informed consent was obtained from parents before 
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CONDITIONS FOR STUDY OF BACTERIAL INTERFERENCE USING STREPTOCOCCUS ORALIS PARKER OR 
STREPTOCOCCUS ORALIS BOOTH 


Moraxella catarrhalis as Target 




















NTHi as Target 
NTHi and S oralis Parker versus NTHi and § oralis Booth 











M catarrhalis and S oralis Parker versus M catarrhalis and 
S oralis Booth 


M catarrhalis and S oralis Parker versus M catarrhalis alone 


M catarrhalis and S oralis Parker versus M catarrhalis and 
adenoidal filtrate (M catarrhalis) 


M catarrhalis and S oralis Booth versus M catarrhalis alone 


M catarrhalis and S oralis Booth versus M catarrhalis and 
adenoidal filtrate (M catarrhalis) 


NTHi and $ oralis Parker versus NTHi alone 


NTHi and S oralis Parker versus NTHi and adenoidal 
filtrate (NTHi) 


NTHi and S oralis Booth versus NTHi alone 


NTHi and S oralis Booth versus NTHi and adenoidal 
filtrate (NTHi) 








NTHi — nontypeable Haemophilus influenzae. 














surgery. All sampling for bacteriology was performed 
by one of the authors (J.M.B.). None of the children 
had taken antibiotics for 3 weeks before surgery. The 
adenoids were used in 10 different experimental con- 
ditions (see Table). 


BACTERIOLOGIC INTERFERENCE STUDIES 


Experiment 1. The fresh adenoid tissue was de- 
contaminated by incubation overnight at room tem- 
perature in transport medium containing vancomy- 
cin hydrochloride, amphotericin B, and colistin sul- 
fate. The tissue was then rinsed in sterile saline solu- 
tion and cut into 6 equal-sized pieces measuring about 
3 mm. The pieces of tissue were placed in separate 
wells of a tissue culture tray. Minimal essential me- 
dium was added to each well to prevent drying of 
the tissue, but not to cover the surface of the tissue. 
We placed 20 uL of a 0.5—McFarland unit suspen- 
sion of NTHi or Moraxella catarrhalis on the sur- 
face of each piece of adenoid tissue and incubated 
them in 5% to 7% carbon dioxide (CO2) at 36°C. At 
0, 2, 4, 6, 8, and 24 hours, the adenoid tissue was 
homogenized and 1 uL was spread onto trypticase 
soy agar (TSA) with 5% sheep blood and chocolate 
agar plates. The plates were incubated in 5% CO2 at 
36°C for 24 hours. The number of colony-forming 
units (CFU) per milliliter was counted and identi- 
fied by conventional bacteriologic procedures. 


Experiment 2. The adenoidal tissue was prepared 
as before. We placed 20 uL of a 0.5—McFarland unit 
suspension of § oralis (isolated from either patient 
Booth or patient Parker) on the surface of the ade- 
noid tissue and incubated it in 5% to 7% CO2 at 36°C 
for 1 hour. We placed 20 uL of a 0.5—McFarland unit 
suspension of NTHi or M catarrhalis on the surface 
of each piece of adenoid tissue and incubated it in 
5% to 7% CO2 at 36°C. Inoculation with NTHi or M 
catarrhalis was performed 1 hour after the inocula- 
tion with S oralis. The plates were incubated in 5% 
to 7% CO2 at 36°C for 24 hours. At 0, 2, 4, 6, 8, and 
24 hours, the adenoid tissue was homogenized and 1 
HL of the homogenate was inoculated onto TSA with 


5% sheep blood and chocolate agar plates. The colo- 
nies were quantified as the number of CFU per mil- 
liliter and identified by conventional bacteriologic 
procedures. 


Experiment 3. The fresh adenoid tissue was steril- 
ized in transport medium containing vancomycin, 


-amphotericin B, and colistin. After overnight steril- 


ization at room temperature, the tissues were rinsed 
in sterile saline solution and cut into 6 equal-sized 
pieces. The pieces of tissue were placed in 5 mL of 
sterile saline solution and incubated at room temper- 
ature for 4 hours. A filtrate of the adenoid pieces was 
made by emulsification and passage through a 25- 
mL, 0.45-um filter. We added NTHi and M catar- 
rhalis to the sterile adenoid filtrates. We placed 20 
uL of a 0.5—McFarland unit suspension of NTHi or 
M catarrhalis in the adenoid filtrate on the surface 
of each piece of adenoidal tissue and incubated it in 
5% to 7% CO2 at 36°C. At 0, 2, 4, 6, 8 and 24 hours, 
1 pL of homogenate was placed onto TSA with 5% 
sheep blood and chocolate agar plates. The plates were 
incubated in 5% to 7% CO2 at 36°C for 24 hours. 
The colonies were quantified as the number of CFU 
per milliliter and identified by conventional bacterio- 
logic procedures. 


Finally, the four bacteria were also incubated with 
saline solution alone and cultured at 0, 2, 4, 6, 8, and 
24 hours to determine their viability. The effects of 
all of these controls on the colonization of NTHi and 
M catarrhalis proved not to be significant and there- 
fore are not included in the results reported below. 


STATISTICAL METHODS 


Comparisons of the effects of S oralis Parker and 
S oralis Booth on colonization with NTHi or M catar- 
rhalis were made with a repeated or multiple analy- 
sis of variance.!4 This model enabled evaluation of 
the effects of each medium, the effect of time, and, 
primarily, the effect of time on any differences in the 
effects of media (an interactive effect). The statisti- 
cal procedure was performed on 10 pairwise com- 
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parisons that are shown in the Table. Statistical sig- 
nificance was set at .05. This Bonferroni correction 
provided an overall type 1 error rate of 5%.!4 


RESULTS 


The Figure, A, demonstrates the effect of S$ oralis 
Parker on the growth of NTHi over a 24-hour peri- 
od. The NTHi alone in the adenoid organ culture per- 
sisted for 24 hours. It maintained more than 104 CFU/ 
mL for the first 8 hours before declining to 10 CFU/ 
mL by 24 hours. In contrast, the inhibitory strain of 
S oralis Parker significantly decreased the growth of 
NTHi in the adenoid homogenate over time. The in- 
hibitory effect appeared by as early as 2 hours. By 6 
hours, NTHi had declined to 10 CFU/mL, and by 24 
hours, NTHi was not recovered. This change was 
highly significant (p < .0001). 
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Results in adenoidal organ culture over time. CFU — colony- 
forming units; NTHi — nontypeable Haemophilus influenzae; 
Parker — Streptococcus oralis Parker; Booth — S oralis Booth; 
M cat. — Moraxella catarrhalis. A) Effect of Parker on coloni- 
zation of NTHi. Result is highly significant as compared to 
growth of NTHi alone. B) Effect of Booth on colonization of 
NTHi. There is no significant effect on growth of NTHi in com- 
parison to NTHi alone in adenoid organ culture. C) Effect of 
incubation of NTHi with sterile adenoid filtrate has no effect 
on colonization of NTHi over 24-hour period. In contrast, there 
is marked effect on growth by Parker. D) Effect of Parker or 
Booth on growth of M cat. E) Effect of Parker on M cat. versus 
M cat. alone. Results approach significance. 


The Figure, B, demonstrates the effect of the non- 
inhibitory strain, S oralis Booth, on the growth of 
NTHi. There was little effect of this organism on the 
growth of NTHi over time in the adenoid organ cul- 
ture. 


The effect of the incubation of NTHi with an ade- 
noidal filtrate is shown in the Figure, C. In compari- 
son to the dramatic effect of S oralis Parker on the 
inhibition of colonization of NTHi, there is no effect 
of the adenoidal filtrate alone on this organism. 


The effect of the inhibitory S oralis Parker on M 
catarrhalis versus that of the noninhibitory S oralis 
Booth on M catarrhalis is demonstrated in the Fig- 
ure, D. There was a significant effect of S oralis Par- 
ker on the growth of M catarrhalis in comparison to 
the noninhibitory § oralis Booth. Finally, the effect 
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of S oralis Parker on the growth of M catarrhalis 
(see Figure, E) did not reach significance, but the 
trend toward inhibition of colonization was evident. 


DISCUSSION 


The emergence of antibiotic-resistant bacteria that 
cause acute otitis media in the past 2 decades has 
made the search for alternative strategies for the pre- 
vention of bacterial upper respiratory tract infections 
critical. Nasal mucosal vaccinations with potential 
pathogens to produce specific secretory IgA in na- 
sopharyngeal secretions against these bacteria; the 
potential use of carbohydrate analogs as a method to 
block bacterial adhesion receptors on the nasopha- 
ryngeal mucosa; and bacterial interference are three 
potential new approaches to the prevention of infec- 
tion in otitis-prone children. All of the above strate- 
gies have one thing in common: the inhibition of colo- 
nization of potential pathogens. The present study 
demonstrates that inhibitory strains of commensal 
bacteria such as S oralis are capable of inhibiting 
the colonization of potential pathogens on the sur- 
face of the adenoid. The bacterial interference ex- 
periments reported here demonstrate a statistically 
significant decrease in the colonization of NTHi and 
a similar trend against M catarrhalis in an adenoid 
organ culture system. Since adenoids possess lym- 
phoid tissue that has the capability of synthesizing 
specific secretory IgA, as well as specific IgG, it was 
important to rule out the possibility that the adenoid 
itself was not responsible for the decrease in coloni- 
zation of NTHi or M catarrhalis. However, the fil- 
trate of adenoid organ culture with NTHi had no ef- 
fect on the colonization of either NTHi or M catar- 
rhalis in an adenoid organ culture over a 24-hour 
period. Therefore, it is unlikely that the adenoid tis- 
sue played a role in inhibiting the colonization of 
the bacteria. Furthermore, the fact that incubation of 
the bacteria in saline solution did not result in loss of 
viability demonstrates that the saline solution had 
no effect on the decrease in growth of the two patho- 
gens seen after exposure to the inhibitory strain of S 
oralis. 


Previous studies from our laboratory have demon- 
strated that S oralis Parker was inhibitory to most 
strains of NTHi and M catarrhalis.!3 It is therefore 
likely that the inhibition of colonization was related 
to the presence of the $ oralis, and not due to any 
intrinsic immunologic effect of the adenoid itself: 
Again, previous experiments from our laboratory 
have shown that sterile filtrates of the a-hemolytic 
Streptococcus Parker had no effect on the survival 
of NTHi.}3 Only intact S oralis was capable of inhib- 
iting NTHi, M catarrhalis, and Streptococcus pneu- 
moniae. We concluded in those experiments that al- 


teration of the local pH and possibly the utilization 
of nutrients were the most likely micro-environmen- 
tal factors responsible for the inhibition of growth of 
the pathogens studied.'3 


Several laboratories have demonstrated that the 
development of both acute otitis media and acute ton- 
sillitis can be altered by the microbial ecology of the 
nasopharynx and oral pharynx, respectively.?!° Com- 
mensal organisms, particularly o-hemolytic strepto- 
cocci in both the mouth and nasopharynx, may in- 
hibit the colonization of potential pathogens such as 
S pneumoniae, NTHi, and M catarrhalis, as well as 
B-hemolytic streptococci. 


Recent investigations by Tano et al!5:!6 demon- 
strated that a-hemolytic Streptococcus removed, par- 
ticularly, from the eustachian tube orifice in healthy 
children significantly decreases the colonization of 
S pneumoniae, NTHi, and M catarrhalis in an agar 
overlay technique. These authors suggested that the 
normal flora might be able to prevent new episodes 
of acute otitis media. However, the mechanisms of 
this inhibition were not identified. 


The most recent investigation by Roos et al!® dem- 
onstrated a highly significant inhibition of nasopha- 
ryngeal pathogens in otitis-prone children treated with 
a nasal spray consisting of a combination of inhibi- 
tory o-hemolytic streptococci including Streptococ- 
cus mitis, Streptococcus sanguis, and S oralis. Those 
children who received this combination of &-hemoly- 
tic streptococci in a nasal spray had a significantly 
reduced recurrence rate of otitis media after appro- 
priate antibiotic therapy as compared to the children 
who did not receive the nasal spray. The authors con- 
cluded that a-hemolytic streptococci given as a na- 
sopharyngeal spray after antibiotic treatment were 
well tolerated and decreased the recurrence rate in 
patients with recurrent acute otitis media. 


Bacterial interference is not a new concept; it has 
been studied for more than a century. Pasteur stud- 
ied the effect of normal commensal gastrointestinal 
bacteria on the anthrax bacillus in cows.!7 Subse- 
quently, there has been very active research on the 
effect of in vitro and in vivo bacterial interference in 
the nasopharynx of children with otitis media follow- 
ing antibiotic therapy.!8 Bacterial interference among 
nasal inhabitants, with the eradication of Staphylo- 
coccus aureus from nasal cavities by the artificial im- 
plantation of Corynebacterium species, has also been 
studied.!9 Finally, the role of Lactobacillus in the pre- 
vention of urogenital and intestinal infections has 
been thoroughly studied by a number of investiga- 
tors.2° Thus, the use of S oralis in the nasopharynx 
is a reasonable, relatively safe, and inexpensive ap- 
proach to the treatment of otitis-prone children. 
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SURGERY OF THE NASAL COLUMELLA IN EXTERNAL VALVE 
COLLAPSE 


ANGELO GHIDINI, MD 
STEFANO DALLARI, MD DANIELE MARCHIONI, MD 
MODENA, ITALY 


The authors describe and discuss their experience with the collapse of the external nasal valve, focusing on the role that surgery 
of the columella plays in solving this problem. Thirty-seven cases were treated. In all but | there had been a previous septorhinoplasty. 
Seven patients had concomitant internal valve collapse. A modified alarplasty associated with columelloplasty was adopted in the 
cases with severe collapse. When the reduction of the nostril opening was mild to moderate, the surgical strategy varied depending on 
the width of the columella base: alarplasty in cases with a narrow to normal columella, and columelloplasty in the presence of a wide 
columella base. The authors’ philosophy tends to minimize intervention on the nasal valve areas. Their results demonstrate that when 
the collapse and its functional effect are not too severe, a good result can be achieved by working only on the columella, especially 
if its base is significantly wide. 


KEY WORDS — alarplasty, columelloplasty, external nasal valve collapse. 


INTRODUCTION columella, whose width can play an important role 


The difference between the external nasal valve in inducing disturbances in nasal respiration. 


and the internal nasal valve has always been a matter 


of controversy. What we call the internal valve is the MATERIAL AND METHODS 

spatial relationship between the caudal edges of the Twenty-seven patients were studied, 26 of whom 
upper lateral (triangular) cartilages and the septum, showed postseptorhinoplasty external valve collapse. 
whereas the external valve is determined by the in- Seven of the 27 had concomitant internal valve col- 
ferior rim of the lateral crura of the lower lateral (alar) lapse due to scarring and triangular collapse. 


cartilages, as it relates to the floor of the vestibulum 


And the columella The entity of external valve collapse was deter- 


mined by endoscopy and during calm or slightly 


Disorders of the nasal valve are sometimes en- forced breathing. The columella was also examined, 
countered in nonoperated noses, but generally follow and its width was related to the referred nasal breath- 
a somewhat suboptimal septorhinoplasty. Resection ing difficulty (Fig 1). The anomaly mostly occurs at 
of large portions of the lateral crura and the related the basal third of the columella, in which there is an 
scarring phenomena can produce both external valve excessive widening and flattening of the columella 
collapse and internal valve collapse, since the upper footplates. In fact, an overly wide columella is un- 
lateral cartilages are no longer supported.6-8 common, and much more difficult to correct. 

We report our experience with cases of external The evaluations suggested the surgical strategy 
valve collapse, highlighting the importance of the (Fig 2). Severe external valve collapse was treated 


Fig 1. External nasal valve collapse and colu- 
mella width. A) Severe collapse, normal colu- 
mella. B) Mild to moderate collapse, large colu- 
mella. C) Mild to moderate collapse, narrow 
columella. 
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EXTERNAL VALVE COLLAPSE 
(27 cases; 7/27 with concomitant IVC) 


severe mild-moderate 
(nostril opening reduction >50% : 4 cases) (nostril opening reduction <50% : 23 cases) 


ALARPLASTY +COLUMELLOPLASTY 


(normal or narrow (large columella: 
columelia:10 cases) 13 cases) 


ALARPLASTY COLUMELLOPLASTY 


(+BONE MODELLING) 


Fig 2. Description of cases and surgical strategy. IVC — 
internal valve collapse. 


through a combination of alarplasty (according to 
Goode? and Toriumi et al!°) and columelloplasty. 
Mild to moderate external valve collapse was treated 
with either columelloplasty (13 cases with an evident 
large columella) or alarplasty (10 cases with a normal 
to narrow columella). 


Our alarplasty technique is a combination of those 
of Goode? and Toriumi et al! and is based on the 
application of autologous cartilage inserts (nasal or 
conchal) into a pocket created by either an endonasal 
or an open approach (Fig 3). 


The columelloplasty!! is performed via 2 incisions 
along the caudal edges of the medial alar crura, dis- 
section and removal of soft tissue, section of the me- 
dial crura “feet,” and their reapproximation with su- 
tures (Fig 4). Section of the basal third of the medial 
crura is necessary to achieve a complete and persis- 
tent reduction of the base of the columella and to 
avoid an undesired augmentation of its length. 


In 6 of the 27 cases, we added resection of the 





Fig 3. Alarplasty (see explanations in text). 











Fig 4. Columelloplasty (see explanations in text). 


lateral tip of the bony nasal spine and flattening of 
the pyriform crests when their hypertrophy was con- 
tributing significantly to nasal bulging. We prefer 2 
large incisions, because the unilateral approach makes 
exposure of the contralateral footplate more difficult; 
2 incisions are even more necessary when a trimming 
of the nasal spine is planned. 


The degree of nasal obstruction was assessed 
through subjective (self-administered questionnaire) 
and objective (rhinorheomanometric) criteria. Before 
operation, 7 patients had complete obstruction (grade 
3), 12 patients had incomplete, persistent obstruction 
(grade 2), and 8 patients had incomplete, nonconstant 
obstruction (grade 1). 


RESULTS 


The postoperative results are shown in Table 1, 
which relates the functional outcome to the preopera- 
tive status and the surgical technique adopted. The 
typical appearance of the postoperative result is shown 
in Fig 5. 


TABLE 1. SURGICAL RESULTS 


Alarplasty + 
Columelloplasty  Alarplasty | Columelloplasty 














(n=4) (n= 10) (n= 13) 
Grade Preop Postop Preop Postop Preop  Postop 
0 3 6 11 
l 6 4 7 2 
2 l 4 6 
3 4 


Grade 0 — no subjective functional impairment; grade 3 — severe 
subjective functional impairment. 
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Fig 5. How columella changes. Typical postop- 
erative result in case of large columella with con- 
comitant pyriform crest hypertrophy. 


All patients reported a functional improvement of 
the nasal obstruction. The 7 patients with incomplete 
results were the ones with concomitant internal nasal 
valve scarring. The rhinorheomanometric data corre- 
lated with the subjective results; a flow-rate increase 
was noticed during forced respiration (Table 25-6). 


DISCUSSION 


The importance of lateral nasal wall stability for 
good respiration through the nose is well known. 
Thus, patients referred with nasal disturbances should 
be examined with attention to the nasal valve ele- 
ments, as well as to the movements of the lateral nasal 
wall and the tactile consistency of the nasal alae. 


Our experience shows that in cases of severe alar 


TABLE 2. RHINOMANOMETRIC DATA 











Inspiratory Mean +SEM 
Effort (mL) Pp 
Preoperative Quiet 84.96 + 10.93 


Forced 316.15+57.35 <.001 
Postoperative Quiet 198.04 + 46.19 <.001 
Forced 422.19 + 46.45 


Postoperative functional results in 27 patients treated with alarplasty 
and columelloplasty (according to Constantian’s evaluation>-®), 








collapse, the complete and optimal reconstruction of 
the alar arch is mandatory for achieving good func- 
tional results. On the other hand, for mild collapses, 
the correction of a large columellar base (possibly 
integrated with bony spine and/or pyriform crest re- 
section) leads to results that are equal to or even better 
than those obtained through an alarplasty. Alarplasty, 
moreover, can sometimes be difficult to perform and 
may produce an unpleasant thickening of the supra- 
alar region. 
CONCLUSIONS 


Our approach has always been to minimize subse- 
quent reoperations on the valve area. All of the pro- 
posed techniques, in fact, show “costs” in terms of 
scarring retractions and adhesions. 


In this report, we aim to demonstrate that good 
functional results can be achieved with minimally 
invasive interventions in many cases of valve insuf- 
ficiency, even if there is a certain amount of defici- 
ency in support of the lateral nasal wall, and especial- 
ly if there is a concomitant large columella and aes- 
thetic corrections are not needed. 

Needless to say, a thorough knowledge of the vari- 


ous techniques is required in order to tailor the surgi- 
cal choice to the individual case. 
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COMPARISON OF SURGICAL OUTCOMES IN PRIMARY 
ENDOSCOPIC DACRYOCYSTORHINOSTOMY WITH AND WITHOUT 
SILICONE INTUBATION 
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The surgical outcome of endoscopic endonasal dacryocystorhinostomy was analyzed in 30 cases of postsaccal stenosis. Intubation 
with silicone tubing was used in 14 cases (46.7%) and not used in 16 cases (53.3%). Surgical success was evaluated subjectively and 
objectively. The patients’ complaints were improved in 85.7% of cases in the intubation group, and in 81.3% of the group in which no 
stent was used. Postoperative endoscopic examinations revealed that the rhinostomy opening was visible in 11 cases without intubation 
(68.8%) and 9 cases with intubation (64.3%). Six patients in the intubation group (42.9%) had granulation tissue at the rhinostomy 
site. Four patients (28.6%) had complaints regarding the intubation. Considering the similar surgical success rates, and the granulation 
formation, patient discomfort, and cost related to intubation, we recommend endoscopic dacryocystorhinostomy without intubation 
as the treatment of choice in cases of chronic epiphora due to postsaccal stenosis of the lacrimal canal. 


KEY WORDS — dacryocystorhinostomy, endoscopic dacryocystorhinostomy, lacrimal surgery, silicone intubation. 


INTRODUCTION 


With the introduction of high-resolution endo- 
scopes for paranasal surgery, endoscopic endonasal 
dacryocystorhinostomy (En-DCR) has begun to gain 
popularity.!-* It has been proposed as an alternative 
surgery to the external DCR (Ex-DCR) operation in 
cases of chronic epiphora due to stenosis of the naso- 
lacrimal duct.* 


Closure of the rhinostomy opening was considered 
a main factor in surgical failure in Ex-DCR opera- 
tions.?:!° In Ex-DCR, several methods, such as use 
of silicone tubing, application of mitomycin-C to the 
rhinostomy opening, and suturing of the mucosal 
flaps, have been suggested for providing a permanent 
rhinostomy opening after completion of mucosal 
healing.’:!0-!2 However, because application of mito- 
mycin-C is controversial and no mucosal flaps are 
created in En-DCR, insertion of silicone tubing into 
the lacrimal duct is the most commonly preferred 
procedure. It has been claimed that silicone tubing 
would improve surgical outcomes of En-DCR.*> Our 
experience with En-DCR without silicone intubation 
revealed that the surgical success rate was about 90% 
in primary cases — similar to the rate in several re- 
ports from different clinics.’ This high success rate 
raises the question of to what extent additional benefit 
can be obtained by using silicone tubing. In a litera- 
ture survey, we failed to find a detailed study com- 
paring the surgical results of procedures with silicone 
intubation versus those of procedures with no intu- 


bation in En-DCR patients. Furthermore, some re- 
ports indicate that silicone tubing itself might have 
been a cause of surgical failure.!3!4 Therefore, the 
aim of this study was to compare the surgical out- 
comes of En-DCR with and without insertion of sili- 
cone tubing. 


PATIENTS AND METHODS 


Thirty-seven consecutive patients who had no pre- 
saccal stenosis, no previous lacrimal surgery, no his- 
tory of trauma, and no suspicion of malignancy un- 
derwent En-DCR for moderate to severe epiphora 
from November 1995 to December 1999 at the Celal 
Bayar University otorhinolaryngology and ophthal- 
mology departments. Of these, 25 patients (67.6%) 
who had a follow-up duration of more than 4 months 
were included in the analysis of the results. Five pa- 
tients had bilateral surgery. Therefore, a total of 30 
cases were included in the study. The data regarding 
the patient’s age, sex, date of operation, type of sur- 
gery (with or without silicone intubation), and side 
of operation were examined. 


All patients underwent complete preoperative oph- 
thalmologic and endoscopic nasal examination. Na- 
solacrimal obstruction was investigated by lacrimal 
irrigation. All patients demonstrated a resistance to 
the flow of saline solution, with regurgitation through 
the opposite punctum. Active transport dacryocys- 
tography was performed for each case to determine 
the site of obstruction as described in earlier reports. !5 
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Fig 1. Postoperative endoscopic visualization after endo- 
scopic endonasal dacryocystorhinostomy (right side). 
mt — middle turbinate, S — nasal septum, arrow — lac- 
rimal opening. 


Only patients with postsaccal stenosis and who had 
normal or dilated lacrimal sacs underwent operation. 


All procedures were performed by a joint team in- 
cluding the same otorhinolaryngologist (H.H.U.) and 
the referring ophthalmologist. The majority of the pa- 
tients who underwent operation between November 
1995 and April 1998 were assigned not to receive 
silicone intubation. The majority of the patients op- 
erated on between May 1998 and December 1999 
were assigned to receive silicone intubation. These 
two groups of patients were homogeneous regard- 
ing the preoperative clinical findings. 


The surgical technique of En-DCR was described 
in detail elsewhere.’ It is summarized as follows. Na- 
sal mucosal decongestion is achieved with an injec- 
tion of 1% lidocaine hydrochloride with epinephrine 
bitartrate, then with nasal packing soaked in a 4% 
cocaine hydrochloride solution. A posterior incision 
is placed 5 mm and 10 mm anterior to the middle 
turbinate attachment and the free border of the unci- 
nate process, respectively. A rectangular mucosal win- 
dow 5 x 10 mm in dimension is created and removed. 
The lacrimal bone is removed with a chisel and ham- 
mer. The medial sac wall is tented up with a probe. A 
vertical incision is made on the wall of the sac, and 
part of the medial wall approximately 5 mm in diam- 
eter is resected. The lacrimal canal is irrigated with 
saline solution. When silicone intubation is per- 
formed, the tubes are cut free from stainless steel 
probes and are knotted together. The tubes are fur- 
ther tied with a 4-0 silk suture. 


All patients were discharged on a 10-day regimen 
of oral antibiotic therapy and nasal steroid spray. The 
patients were given topical antibiotic (tobramycin 
sulfate 0.3%) and steroid (fluorometholone acetate 
0.1%) eyedrops | or 2 drops 4 times a day for 10 
days. 

The patients without silicone intubation were ex- 
amined once a week in the first month after opera- 
tion. Fibrin clots and crusts were removed by endos- 
copy during the postoperative visits. Thereafter, the 
examinations were made every 15 days until mucosal 
healing was complete. The patients with silicone in- 
tubation were examined at the end of the first post- 
operative week. The silicone tubes were removed in 
the eighth postoperative week. The presence of sili- 
cone tubing impeded us from doing a thorough endo- 
scopic cleaning of the rhinostomy site in this group 
of patients during the first postoperative 8-week pe- 
riod. Follow-up examinations were conducted every 
2 months for both groups of patients. 


The surgical outcome was evaluated both subjec- 
tively and objectively. In the subjective assessment, 
the patients were asked to grade the degree of epiph- 
ora relief on a 5-point scale: a score of | for symp- 
tom-free, a score of 2 for significantly improved, a 
score of 3 for slightly improved, a score of 4 for no 
improvement, and a score of 5 for worse. Any decla- 
ration of improvement by the patient (1, 2, or 3) was 
considered as success. They were asked whether they 
were disturbed by the silicone tubes, and whether they 
were disturbed by the frequent postoperative nasal 
examinations. 


The objective assessment examined the visibility 
of the rhinostomy opening (Fig 1), the presence of 
granulation tissue at the rhinostomy opening, and 
whether any fluorescein dye was observed in the na- 
sal cavity after instillation of a solution of 2% fluo- 
rescein to the eye, either spontaneously after a few 
blinks or after pressure was applied over the lacrimal 
sac. Also, the performance of additional surgery and 
the presence of intranasal synechiae were recorded. 


The group means were compared with Student’s 
t-test. The success rates and other categorical vari- 
ables were tested with the Pearson y? test and Fisher’s 
exact test. We regarded p values smaller than .05 as 
significant. 


RESULTS 


This study included 30 En-DCR procedures in 25 
patients. The demographic and operational character- 
istics of the patients are presented in Table 1. Sili- 
cone tubing was used in 14 cases (46.7%), and no 
stenting device was used in 16 cases (53.3%). Nine 
of 14 cases with silicone intubation (64.3%) and 12 
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TABLE 1. DESCRIPTIVE CHARACTERISTICS 


TABLE 3. ENDOSCOPIC FINDINGS 











With Without 
Silicone Silicone All Cases p 
Sex 4337 
Male 3 (21.4%) 2(125%) 5 (20.0%*) 
Female 11 (78.6%) 14 (87.5%) 25 (80.0%*) 
Age (y) 1364 
Mean + SD 45.7+13.8 40.2+7.3 43.0+11.3 
Range 31-66 23-49 23-66 
Follow-up (mo) 001} 
Mean + SD 14.5 10.3 30.4 13.7 23.1 +14.5 
Range 4-47 5-51 4-51 
Laterality 4507 
Right 10 (71.4%) 10 (62.5%) 20 (66.7%) 
Left 4 (28.6%) 6(37.5%) 10 (33.3%) 
Additional procedure 7238 


Septoplasty 5 (35.7%) 
Septoplasty + 


9 (56.3%) 14 (46.7%) 


concha 
resection 1 (7.1%) 1 (6.3%) 2 (6.7%) 
Concha 
resection 3 (21.4%) 2(125%) 5 (16.7%) 


*According to number of patients. Rest of data were calculated ac- 
cording to number of operated sides (30). 

+Fisher’s exact test. 

£Student’s t-test. 

§Pearson ¥? test. 





of 16 cases without intubation (75%) included addi- 
tional procedures. The indications for additional pro- 
cedures are shown in Table |. The mean duration of 
follow-up was 15 months (range, 4 to 47 months) 
for patients with silicone tubing, whereas it was 30 
months (range, 5 to 51 months) for patients without 
tubing. The difference in follow-up times was statis- 
tically significant (p = .001). 


__ TABLE 2. RESULTS OF OPERATION 


With Without 
Silicone Silicone 
No. % No. % p 














Subjective evaluation .536* 
Symptom-free 9 643 10 62.5 
Significant improvement 2 14.3 3 18.8 
Slight improvement l TA 
Same 1 Tl 3 18.8 
Worse 1 Tl 

Outcome 567* 
Success 12 85.7. 13 81.3 
Failure 2 43 3 18.8 

Discomfort from tube 
Yes 4 28.6 
No 10 71.4 

Discomfort from debridement 
Yes 
No 16 100 


*Fisher’s exact test. 











With Without 
Silicone Silicone 
No. % No. % p 

Fluorescein flow .942* 
Spontaneous 9 643 10 625 
With pressure 2 143 3 18.8 
No flow 3 21.4 3 18.8 

Synechiae 1837 
Present 3 21.4 7 43.8 
Absent 11 78.6 9 56.3 

Granulation 025+ 
Present 6 42.9 l 6.3 
Absent 8 57.1 15 93.8 

Rhinostomy opening S504 
Visible 9 643 11 68.8 
Invisible 5: 35.7 > 31,3 


*Pearson y? test. 
+Fisher’s exact test. 





The outcomes of subjective assessment of the sur- 
gery are shown in Table 2. Primary surgery was suc- 
cessful in 12 cases with silicone tubing (85.7%). One 
of the 2 patients who underwent revision surgery im- 
proved. Thus, the overall success rate after all surger- 
ies was 92.9% in the silicone intubation group. The 
primary success rate was 81.3% (13/16) for patients 
without tubing. Only | patient underwent revision sur- 
gery in this group. The secondary success rate after 
all surgeries was 87.5% (14/16) for patients without 
tubing. There was no statistically significant differ- 
ence between the surgical outcomes of the 2 surgery 
groups on the basis of the subjective evaluation. 


The data of the objective assessment are presented 
in Table 3. Six of the patients with silicone tubing 
(42.9%) had some form of granulation tissue at the 
rhinostomy opening (Fig 2), whereas | of the pa- 
tients without tubing (6.3%) had granulation tissue. 
There was a statistically significant difference be- 
tween the 2 groups regarding the presence of gran- 
ulation tissue (p = .025). Seven of the patients with- 
out tubing (43.8%) and 3 of the patients with tubing 
(21.4%) had some formation of synechiae between 
the nasal septum and the lateral nasal wall. 


The patency of the rhinostomy openings as as- 
sessed by fluorescein is presented in Table 4. One 
patient from each surgery group failed to show evi- 
dence of fluorescein flow on endoscopy, but neither 
had subjective complaints. 


The only complications were in 2 patients who had 
ecchymosis around the medial canthal area and | pa- 
tient who had prolapse of the silicone tube. The pro- 
lapse was resolved by pulling the tube to the nasal 
cavity by intranasal endoscopy. 
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Fig 2. Granulation tissue (asterisk) at intranasal lacrimal 
opening after removal of silicone tube in eighth postop- 
erative week (right side). 


DISCUSSION 


Dacryocystorhinostomy is a commonly performed 
operation in which a fistulous tract is created between 
the lacrimal sac and the nasal cavity in order to alle- 
viate the epiphora due to nasolacrimal duct obstruc- 
tion. Patency of this fistula is claimed to be respon- 
sible for the surgical success.?:!° Silicone tubing has 
been proposed to maintain the patency of the fistula 
by impeding fibrous closure during the postoperative 
healing period.*-> Silicone intubation is a commonly 
suggested procedure in Ex-DCR, especially in cases 
of previous acute dacryocystitis, a small lacrimal sac, 
canalicular disease, or poor mucosal flap formation, 
or because of surgeon preference.!! It is also recom- 
mended for all En-DCR procedures, because the sur- 
gical ostium created during En-DCR heals with gran- 
ulation, as it is not technically possible to create an 
epithelium-lined fistula. It also has been shown that 
patients who adequately retained silicone tubes per- 
formed better than patients who failed to retain sili- 
cone tubes after primary En-DCR.® 


The primary success rates in the present study were 
81.2% in the group without silicone intubation and 
85.7% in the group with intubation. There was not a 
significant difference between these success rates. The 
reported success rates of En-DCR vary between 82% 
and 95%4-57.8 and thus are similar to our success 
rates. The high success rate in patients without intu- 
bation in this study raises the question of the neces- 
sity of intubation in En-DCR. There is a lack of com- 
parative studies of the surgical results of En-DCR 
with and without silicone tubing. However, our find- 


TABLE 4. EVALUATION OF RHINOSTOMY PATENCY 
BY FLUORESCEIN FLOW 


_ With Silicone 








Without Silicone 


Success Failure Success Failure 

Fluorescein Flow No. % No. % No. % No. % 
Spontaneous 9 75.0 0 0 10 76.9 0 0 
With pressure 2 16.7 0 O> 2) W547 - 33.3 
No flow t- 83 2. 100% 1 - 7.72: 66.7 


ings indicate that creation of a patent ostium with ade- 
quate epithelialization is possible without the use of 
tubing in patients with normal or dilated lacrimal sacs 
who undergo En-DCR. It can be stated that intuba- 
tion might not improve the surgical results signifi- 
cantly. 


Walland and Rose!* cited Allen and Berlin as re- 
porting that silicone intubation in Ex-DCR may ac- 
tually increase the risk of failure. However, Walland 
and Rose have shown that intubation does not in- 
crease the risk of failure and, with appropriate case 
selection, may increase the success rate of Ex-DCR 
in patients with canalicular disease. Mannor and Mil- 
man!® reported that patients with a normal or dilated 
lacrimal sac had a success rate of 82%, whereas those 
with a scarred sac had only a 29% likelihood of suc- 
cess with En-DCR. We have not operated on patients 
with scarred sacs, and the results of our study do not 
indicate the elimination of the usage of silicone in- 
tubation in such patients. 


Apart from proper patient selection, several addi- 
tional factors, such as the diameter and location of 
the rhinostomy opening and synechia formation be- 
tween the rhinostomy site and intranasal structures, 
might have an influence on the surgical outcome.?* 
Although the number of additional surgical proce- 
dures was higher in the group without intubation, 
there was no statistically significant difference regard- 
ing synechia formation between the 2 groups. Hence, 
it can be assumed that synechiae had no significant 
influence on surgical success. 


Besides their advantages, the silicone tubes have 
some disadvantages that exert an influence on the 
outcome of surgery. First, they are not free of com- 
plications. They may cause granulation, chronic in- 
fection, or canalicular laceration, and they may be 
dislodged. !!.!4.!7-19 Kong et al!® showed that granu- 
lation tissue was present after laser En-DCR with 
silicone intubation in 50% of patients at 8 weeks af- 
ter operation. Similarly, we found that 6 of the pa- 
tients in the intubation group (42.9%) had some form 
of granulation tissue; in contrast, only | patient with- 
out intubation (6.3%) had granulation tissue. The 
granulation tissue extended over the silicone tubing 
to the nasal cavity as a vascular fleshy mass obscur- 
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ing the visibility of the rhinostomy opening. In the 
majority of the cases, the granulation tissue appeared 
around 3 weeks after the operation. In most cases, 
the granulation tissue disappeared with simple exci- 
sion and topical steroids. However, we have not ob- 
served a detrimental effect of granulation on the suc- 
cess of En-DCR. 


Silicone tubing seemed to provoke the formation 
of granulation tissue by acting as a foreign material. 
Weber et al?° explained 3 possible mechanisms lead- 
ing to granulation tissue and scar formation at the 
nasofrontal duct. First is through persistent block- 
age of the duct with blood and fibrin in the early 
postoperative period. Second is through pronounced 
swelling beginning in the third postoperative week, 
leading to zones of contact with adjacent or opposed 
areas of the nasofrontal duct. Third is through reori- 
entation of collagen fibers in the remodeling phase 
beginning in the third postoperative week. This last 
supports our observations, in which the granulation 
tissue appeared 3 weeks after the operation. 


There is no definitive agreement among authors 
as to how long the tubes must remain in place.?° Al- 
though many surgeons remove the tubes after 8 weeks, 
some have recommended keeping them in place for 
6 months.?° 


Tube prolapse has been reported to be one of the 
most frequent complications of En-DCR operations 
with silicone intubation.!? The tube may also undergo 
superior displacement and be cut by the uninitiated 
surgeon. We encountered this complication in | of 
our patients. Our routine of securing the tubes with 
2 square knots and tying with 4-0 silk seems to be 
adequate to prevent tube prolapse. 


Four of our patients (28.6%) reported discomfort 
from the tube. Moreover, intubation increases the cost 
of treatment in En-DCR and Ex-DCR operations. 
Given these negative findings, we do not advocate 
the use of intubation in En-DCR operations in pa- 
tients with normal or dilated sacs. 


As stated above, the surgical outcome is thought 
to be related to the patency of the rhinostomy open- 
ing. It has been believed that the patency is highly 
related to the size of the surgically created fistula. 
Nevertheless, the size of the epithelialized ostium 
necessary for patency is controversial. Linberg et al!° 


have shown that in a series of patients who under- 
went Ex-DCR, the average diameter of the functional 
Ex-DCR fistula after wound healing was 1.80 mm. 
Metson? pointed out that the exact location of the 
smallest opening in the lateral nasal wall was often 
not apparent until fluorescein dye was instilled in 
the eye. Bumsted et al? showed that the area of the 
healed intranasal ostium was approximately 2% of 
the area of the surgical anastomosis. In addition, they 
were not able to show any correlation between the 
size of the surgical anastomosis and the healed nasal 
ostium in patients who underwent Ex-DCR. These 
studies suggest that the presence of a surgically cre- 
ated epithelium-lined fistula as in Ex-DCR does not 
prevent contracture of the wound. The success of the 
surgery seems not to be directly related to the size of 
the healed ostium. It has been shown that even the 
smallest ostium, once healed and patent, has contin- 
ued to provide good lacrimal drainage.? The find- 
ings of our study corroborate these previous observa- 
tions. Although the rhinostomy opening was visible 
in 11 patients without intubation (68.8%) and 9 pa- 
tients with intubation (64.3%), 85.7% of the patients 
in the intubation group and 81.2% in the no-intuba- 
tion group declared clinical improvement in their 
symptoms. 


The natural wound healing process — namely, the 
hemostasis, inflammation, proliferation, and remod- 
eling that take place in a sequential manner — has 
an important influence on the patency of the rhinos- 
tomy opening.!° This normal process is reported to 
be responsible for En-DCR and Ex-DCR failures by 
causing 1) primary ostium closure, 2) formation of a 
cicatrix with the middle turbinate, 3) formation of a 
cicatrix with the nasal septum, and 4) formation of a 
granuloma within the ostium.® These findings indi- 
cate that the key to a successful En-DCR operation 
is the adequate epithelialization of the ostium cre- 
ated by surgery without any synechia formation. 


In conclusion, considering that the surgical results 
of En-DCR with and without silicone intubation are 
almost equal, and considering the risks of granula- 
tion formation and tube prolapse, together with pa- 
tient discomfort from the tube material and the addi- 
tional cost, we recommend that En-DCR without sili- 
cone intubation be the treatment of choice in cases 
of chronic epiphora due to postsaccal stenosis of the 
lacrimal canal. 
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WASHOUT OF 133-XENON AS AN OBJECTIVE ASSESSMENT OF 
PARANASAL SINUS VENTILATION IN ENDOSCOPIC SINUS SURGERY 


BO PAULSSON, MD 
HELSINGBORG, SWEDEN 
SVEN LINDBERG, MD, PHD 
LUND, SWEDEN 


PER OHLIN, MD, PHD 


HELSINGBORG, SWEDEN 


Endoscopic sinus surgery (ESS) is today a common method for the treatment of chronic rhinosinusitis. Assessment of the results 
has been based mainly upon subjective evaluation, and only a few reports present objective measurements. In the present study, the 
133-xenon washout technique was used for preoperative and postoperative evaluation of paranasal sinus ventilation in 12 patients 
selected for ESS. The postoperative half-times (T!/2) of 133-xenon washout were lower in the sinuses with abnormal preoperative 
half-times (T!/2), especially in the maxillary sinuses, where the postoperative T!/2 was 44 (22 to 150) minutes (median and quartiles, 
Q1-Q3) as compared with a preoperative T!/2 of 202 (94 to 278) minutes. The postoperative evaluation included a questionnaire and 
a follow-up visit with endoscopy and measurements of nasal nitric oxide. The results showed that patients who declared a marked 
reduction in symptoms exhibited significantly improved sinus ventilation. However, no direct correlation was found between 
improvement in ventilation and symptom improvement. Nine of the 12 patients showed improvement on endoscopy, and these 
patients also exhibited improved sinus ventilation. The postoperative nasal nitric oxide levels were within the normal range in 11 of 
the 12 patients; the other patient showed pathological T!/2 values for all paranasal sinuses. The 133-xenon washout technique is thus 
a method that can be used for objective evaluation of the ventilation of the paranasal sinuses before and after ESS procedures. 
However, the technique cannot be used to evaluate sinuses with totally obstructed ostia or postoperative sinuses with very wide neo- 
ostia, as rapid washout may lead to no activity remaining at the time of measurement. 


KEY WORDS — endoscopic sinus surgery, nitric oxide, objective results, paranasal sinus ventilation, questionnaire, xenon. 


INTRODUCTION 


The concept of endoscopic sinus surgery (ESS) is 
to remove disease in the area of the ostiomeatal com- 
plex, to improve ventilation and mucociliary clear- 
ance through the natural ostia of the paranasal si- 
nuses.! Although ESS is a common procedure with 
generally accepted surgical principles, there is cer- 
tainly a need for follow-up and evaluation of the re- 
sults. Most reports have dealt with patients’ subjective 
symptoms, and reports presenting methods for objec- 
tive evaluation are rare.*-4 Radiologic methods, such 
as preoperative and postoperative computed tomog- 
raphy (CT) scans, have, however, been used.° To the 
best of our knowledge, the most thorough report of 
objective assessment included endoscopic findings 
made before and after operation, nasal inspiratory 
peak flow testing, anterior rhinomanometry, acoustic 
rhinometry, tests of olfaction, and tests of mucociliary 
function.® Of these, olfaction and mucociliary clear- 
ance improved significantly. The sinus ventilation 
was not measured, however. Thus, there is a need 
for measurements of ventilation of the paranasal si- 
nuses in patients with chronic sinus disease, before 
and after ESS. Noninvasive methods for the measure- 


ment of ventilation of the paranasal sinuses with 133- 
xenon have previously been developed for use in 
healthy subjects’ and in patients with chronic sinus- 
itis.? An air—133-xenon mixture is insufflated into 
the nasal cavities, passing through the ostia of the 
sinuses. The washout of the 133-xenon from the para- 
nasal sinuses is then followed with a scintillation 
camera, and the washout times are adopted as a mea- 
sure of sinus ventilation. 


The aim of the present study was to use the 133- 
xenon washout method for preoperative and postop- 
erative assessment of paranasal sinus ventilation in 
patients selected for ESS. Furthermore, our intention 
was to compare the results of the 133-xenon washout 
method with the type of endoscopic surgery per- 
formed, the results of a postoperative questionnaire, 
and an endoscopic evaluation. The postoperative as- 
sessment also included measurements of nasal nitric 
oxide (NO), as low nasal NO levels have been shown 
to correlate with sinus disease. !0 


SUBJECTS AND METHODS 


Twelve patients (8 men and 4 women; mean age, 
31 years; range, 24 to 35 years) with chronic sinus 
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TABLE 1. DIAGNOSIS AND ENDOSCOPIC SINUS SURGERY PERFORMED 




















Middle Clearance Reduction 
Meatal Anterior of Frontal of Middle 
Uncinectomy Antrostomy Ethmoidectomy Recess Turbinate 
Patient Diagnosis R L R L R L R L R L 
1 CS S S S S S S 
2 cs S S 
3 CS S S S S 
4 CS S S S 
5 CS S* 
6 cs S S S S S S S 
7 CS S S S 
8 P S S S S S S 
9 P S S S S S S 
10 P S S 
11 P S S S 
12 P S S S S S S 


R —- right; L — left; CS — chronic sinusitis; S — surgery; P — polyposis. 


*Previous endoscopic surgery with uncinectomy, middle meatal antrostomy, and clearance of frontal recess on left side, but recurrent frontal 
sinusitis. Repeated surgery with revision of frontal recess combined with transfrontal approach and removal of interfrontal septum. 





disease were included in the study. They were all 
scheduled for ESS at the decision of an otolaryngol- 
ogy specialist, on the basis of medical history, endo- 
scopic findings, and results of a CT scan of the para- 
nasal sinuses. One of the patients had had previous 
sinus surgery but now presented with recurrent dis- 
ease. Seven of the patients had chronic sinusitis, and 
5 suffered from nasal polyposis according to preop- 
erative endoscopic findings. The results of the CT 
scans and preoperative ventilation of these patients 
were presented previously by Paulsson et al.? 


The 133-xenon washout method’ was used to eval- 
uate the ventilation of the paranasal sinuses in the 
12 patients before and after sinus surgery. It should 
be noted that this technique measures washout from 
main ostia, as well as accessory ostia. In brief, about 
200 MBq of an air—133-xenon mixture is insufflated 
into the nasal cavities. After insufflation, the passage 
of the air—133-xenon mixture into the paranasal si- 
nuses is facilitated by increasing the pressure in the 
nasal cavity. This is done with a rubber balloon, de- 
veloped for the treatment of serous otitis media by 
increasing intranasal pressure (Otovent, Viborg, Den- 
mark). The balloon was connected to the nostril con- 
tralateral to the nostril used for insufflation. To ac- 
custom the patients to the insufflation procedure, we 
instructed them to inflate the balloon twice on both 
sides before xenon insufflation. For insufflation, 
about 200 MBq of 133-xenon gas was mixed with 
air in a 60-mL syringe. Fifteen milliliters of this mix- 
ture was insufflated during 10 seconds in one nostril 
through an olive-tipped catheter while the patient held 
his or her breath. Immediately after insufflation, with 
the catheter still connected, the patient inflated the 
balloon and kept it inflated for 15 to 20 seconds be- 


fore starting normal breathing through the mouth. 
The procedure was then repeated in the contralateral 
nasal cavity. The insufflation procedure was per- 
formed once more in each nostril. After the insuffla- 
tion procedure, the nose was clamped until the mea- 
surements were initiated. The washout of the 133- 
xenon from the paranasal sinuses was measured with 
a scintillation camera for dynamic single-photon- 
emission CT (SPECT; Toshiba GCA-901 A/ECT-S4, 
Tokyo, Japan). The SPECT camera head executed an 
anterior rotation of 180° in 1 minute, with 12° sam- 
pling angles, the rotation being repeated automatical- 
ly every minute for 30 minutes. Three-dimensional 
regions of interest (ROIs) were selected from the ac- 
tivity images obtained for the frontal, maxillary, pos- 
terior ethmoidal, and sphenoidal sinuses. The number 
of counts per minute for each ROI was calculated 
and monitored for 30 minutes. These values were then 
plotted to give 133-xenon washout curves. Exponen- 
tial curve fitting was applied with the software sup- 
plied with the Toshiba camera. The half-time (T!/2) 
of the fitted curve was adopted as the T!/2 of the 133- 
xenon washout. 


In a previous study on healthy subjects,? a T!/2 of 
133-xenon washout within 54 minutes for maxillary 
sinuses, 26 minutes for frontal sinuses, and 64 min- 
utes for posterior ethmoidal and sphenoidal sinuses 
were considered normal. These figures were based 
on the assumption that 95% of healthy subjects dis- 
play a T!/2 within these values (mean + 2 SD), anda 
T'/2 above these limits is considered as possibly 
pathological. These values were used in the present 
study to differentiate between a normal T!/2 and a 
pathological T!/2. 


Postoperative measurements of ventilation were 
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TABLE 2. QUESTIONNAIRE SENT TO PATIENTS 

BEFORE FOLLOW-UP VISIT 

. Is/was your nose blocked? 

. Does/did your nose run? 

. Do/did you get headaches? 

. Do/did you feel dripping at the back of the nose? 

. Do/did you get pains in the face or eyes? 

. Do/did you suffer from a bad smell in the nose? 

. Is/was your sense of smell reduced? 

. Do/did you have recurrent acute sinusitis? 

. Do you feel improved, unchanged or worse after surgery? 


. Would you recommend endoscopic surgery to other pa- 
tients? 

Symptoms before surgery and 1 year after surgery should be graded 

as 0 = none, 1 = mild, 2 = moderate, and 3 = severe in response to 

questions 1-8 (maximum score = 24) Responses to questions 9-10 

are not included in the analysis of the symptom score. 








WOON Nr Ph WON 


= 
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performed after 6 + 4 months tmean + SD). The ESS 
was performed by 2 otolaryngology surgeons who 
performed surgery on 6 patients each. The type of 
surgery performed is given in Table 1. In none of the 
patients was posterior ethmoidzctomy or sphenoidec- 
tomy performed. 


Clinical follow-up of the 12 patients was per- 
formed 5 +2 years (mean + SD) after surgery. A ques- 
tionnaire was sent to the patients before a visit to the 
otolaryngology department. The questionnaire was 
designed according to Fairley et al!! and included 
12 questions, each graded in 4 steps from “no symp- 
toms” to “severe symptoms.” The total sum of the 
scores was used as a measure cf the outcome of ESS. 
Eight of the original Fairley questions were used in 
our study (Table 2). These 8 questions represented 
the most common symptoms found in a study of 113 


patients with chronic sinus disease. 12 In our question- 
naire, the patients were asked about their symptoms 
before surgery, 1 year after surgery, and at the time 
of the follow-up visit. The patients were also asked 
whether their condition was improved, unchanged, 
or worse after surgery, and whether they would rec- 
ommend ESS to other patients with chronic rhino- 
sinusitis. These 2 additional questions were not in- 
cluded in the symptom score calculation. At the clin- 
ical visit, the answers to the questions were discussed 
with the patient by an otolaryngology specialist. The 
follow-up visit included a clinical examination with 
nasal endoscopy performed by an independent otolar- 
yngology specialist well experienced in ESS proce- 
dures. His judgment of the results of the sinus surgery 
was summarized in terms of a positive effect, no ef- 


fect, or a negative effect. 


Measurements of nitric oxide (NO) in the nasal 
cavities were performed with an NO analyzer (CLD 
700 AL med, Eco Physics AG, Durnten, Switzerland) 
in all patients. In brief, this analyzer works by mix- 
ing NO with ozone, producing an activated NOx mole- 
cule, which in turn produces a light quantum. The light 
intensity is proportional to the concentration of NO 
in the analyzed gas mixture. A nasal olive was intro- 
duced gently into the vestibulum nasi, without con- 
tact with the mucosa, and connected to the NO ana- 
lyzer. The subject was instructed to breathe through 
his or her mouth with the contralateral nostril open 
during measurements. On the basis of measurements 
in healthy subjects, NO values below 100 parts per 
billion (ppb) are considered pathological.!° A low NO 
concentration has been shown to correlate with dys- 
function of the mucociliary system,!? and low nasal 


TABLE 3. PREOPERATIVE AND POSTOPERATIVE HALF-TIMES OF 133-XENON WASHOUT IN MINUTES 














Right Maxillary Left Maxillary Ethmoidal and 
Frontal Sinus Sinus Sinus Sphenoidal Sinuses 
Patient Diagnosis Preop Postop Preop Postop Preop Postop Preop Postop 
1 cS NA 44 23 NA 3 NA 4 12 
2 cS 10 10 6 6 7 6 NA NA 
3 CS 10 6 5 3 2 1 NA NA 
4 cs 23 52 il 17 215 I 6 10 
5 CS 100 28 29 71 7 73 NA NA 
6 CS NA NA NA NA NA NA 89 49 
7 cs 4 5 2 0.5 7 1 5 2 
8 P 40 118 86 44 278 230 152 108 
9 P 307 32 314 NA 23 NA 109 45 
10 P NA 88 185 70 17 14 NA NA 
li P 91 79 NA NA NA NA 107 43 
12 P 325 43 202 42 94 NA NA 416 
Mean + SD 1014127 46436 8624110 32 £30 65 + 100 47 £85 67 +61 86 + 138 


Median (Q1-Q3) 


Difference in T!/2 p = .527 (n=9) 


40 (10-204) 43 (10-79) 26(6-189) 30(4-64) 12(6-124) 6(1-73) 89(5-109) 44 (11-93) 


p=.102 (n= 15) p= .126(n=7) 


Data are shown in bold and italic for sinuses for which endoscopic sinus surgery would have effect on ventilation. 
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Fig 1. Half-times of 133-xenon washout in sinuses 
before and after endoscopic sinus surgery. Dashed T 
line indicates upper 95% limit of confidence in- € 
terval (mean + 2 SD) of 133-xenon washout in r 
corresponding sinuses of healthy subjects. Arrows g 
indicate values of sinuses affected by endoscopic z 
sinus surgery. A) Frontal sinus. B) Maxillary si- 
nuses. C) Posterior ethmoidal and sphenoidal si- 
nuses. 
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with chronic sinus disease. 1° 





NO concentrations are frequently found in patients 


Statistics. All results are expressed as medians and 
quartiles (Q1-Q3), unless otherwise stated. We con- 
sidered p values of less than .05 to be significant. 
For intraindividual comparisons of preoperative and 
postoperative T!/2 values and preoperative and post- 
operative symptom scores, the Wilcoxon signed rank 
test was used. For interindividual comparisons, 
Mann-Whitney U-test was performed. Spearman’s 
rank correlation analysis (correlation coefficient giv- 
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en as p) was performed for correlation of the reduc- 
tion in T!/2 (AT!/2) to postoperative symptom scores. 
All statistical analysis was performed with SPSS soft- 
ware (SPSS, Chicago, Ilinois). 


RESULTS 


Relationship Between Postoperative T!/2 and Pre- 
operative T!/2. The preoperative and postoperative 
T!/2 of 133-xenon washout for different paranasal 
sinuses are given in Table 3 and Fig 1. For all para- 
nasal sinuses, the preoperative T!/2 was 37 (6 to 70) 


the 


714 Paulsson et al, 133-Xenon Washout for Objective Assessment in Sinus Surgery 


TABLE 4. COMPARISON OF PREOPERATIVE AND 
POSTOPERATIVE HALF-TIMES (T!/2) OF 133-XENON 
WASHOUT FROM PARANASAL SINUSES OPERATED 

ON, BY WILCOXON SIGNED RANK TEST 


TABLE 5. COMPARISON OF DIFFERENCES IN HALF- 
TIME (AT!/2) OF 133-XENON WASHOUT FROM 
PARANASAL SINUSES OPERATED ON, BY 
MANN-WHITNEY U-TEST 








No. of 
Pts PreopT!/2 PostopT!/2 p(n) 





Subjective improvement 


Slight 3 9 11 593 
(5-69) (7-40) (4) 

Moderate 3 185 42 .043 
(12-264) (8-57) (5) 

Marked 6 104 44 .023 


(52-199) (9-101) (12) 
Endoscopic evaluation 


Improved 9 105 37 007 
(10-212) (5-64) (16) 

Unchanged 3 89 43 .068 
(9-99) (8-64) (6) 


Data are median (and quartiles in parentheses) in minutes. “Subjec- 
tive improvement” refers to questionnaire. n — number of sinuses. 





minutes and the postoperative T!/2 was 23 (6 to 109) 
minutes. This decrease is significant (p = .020; n = 
31). When the results were divided according to fron- 
tal, maxillary, posterior ethmoidal, and sphenoidal 
sinuses, the postoperative values were not signifi- 
cantly lower than the preoperative values (Table 3). 
However, for sinuses with a pathological preopera- 
tive T!/2, the postoperative value was 45 (40 to 86) 
minutes as compared with the preoperative value of 
131 (92 to 262) minutes, and this difference was sig- 
nificant (p = .006; n = 14). The change in T!/2 was 
particularly marked in the maxillary sinuses with a 
pathological preoperative T!/2: from 202 (94 to 278) 
minutes to 44 (22 to 150) minutes (p = .043; n = 5). 
In contrast, the sinuses with a normal preoperative 
T'/2 of 7 (4 to 16) minutes were not significantly 
influenced by surgery (postoperative T!/2, 6 [3 to 16] 
minutes; p = .378; n = 17). 


Relationship Between T'/2 and Surgery Performed. 





No. of Pts AT!/2 

Subjective improvement 

Slight 3 1 (-6-30) 

Moderate 3 115 (4-221) 

Marked 6 46 (5-70) 
Endoscopic evaluation 

Improved 9 46 (2-149) 

Unchanged 3 12 (1-52) 


Data are median (and quartiles in parentheses) in minutes. For “slight” 
versus “moderate,” p = .063 (n = 9 sinuses). For “slight” versus 
“marked,” p = .042 (16 sinuses). For “moderate” versus “marked,” p 
= .442 (17 sinuses). For “improved” versus “unchanged,” p = .275 
(21 sinuses). 





The extent of the ESS procedure determines which 
of the paranasal sinuses are influenced by surgery 
(see arrows indicating the paranasal sinuses affected 
by surgery in Fig 1). The postoperative T!/2 values, 
42 (8 to 60) minutes, were significantly (p = .002; n 
= 21) lower than the preoperative values, 91 (9 to 
194) minutes, in the sinuses affected by surgery. The 
reduction in T!/2 (AT!/2) after surgery was 42 (2 to 
94) minutes. 

Relationship Between T'/2 and Diagnosis. In pa- 
tients with chronic sinusitis, the preoperative T!/2 for 
the paranasal sinuses was 7 (5 to 23) minutes, in con- 
trast to 109 (86 to 278) minutes in patients with poly- 
posis. This difference is highly significant (p < .001; 
n= 36). The T'/2 was significantly reduced, to 56 (20 
to 149) minutes after surgery, for the patients with 
polyposis (p = .015; n = 12). In the patients with 
chronic sinusitis, the corresponding reduction was 1 
(—6 to 4) minute, and it was not significant (p = .813; 
n= 19). 

Relationship Between T!/2 and Results of Ques- 
tionnaire. The preoperative and postoperative symp- 
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Fig 3. Preoperative and postoperative half-times 
of 133-xenon washout for sinuses in patients who 
were improved and patients who were unchanged 
according to endoscopic evaluation. 
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tom scores on the questionnaire provided another way 
of categorizing the patients. The postoperative symp- 
tom score of 5.5 (0.25 to 12) and the current symp- 
tom score of 2.5 (0 to 7.5) were significantly lower 
than the preoperative score of 15.5 (12.4 to 21.5; p= 
.003, n = 12; and p = .002, n = 12). The symptom 
score at the follow-up visit did not differ from the 
postoperative symptom score (p = .262; n = 12). On 
the basis of the postoperative symptom score, the 
patients could be divided into 3 groups: those with 
scores between 0 and 5 (marked improvement), those 
with scores between 6 and 10 (moderate improve- 
ment), and those with scores between 11 and 24 
(slight improvement). The postoperative T!/2 for the 
sinuses affected by surgery was significantly reduced 
in patients with moderate or marked improvement 
(Table 4 and Fig 2). Moreover, the AT!/2 was signif- 
icantly smaller in patients with slight improvement 
than in those with marked improvement (p = .042; n 
= 16; Table 5). However, no direct correlation was 
found between the AT'/2 and the postoperative symp- 
tom score (p = 0.294; p = .195; n = 21) or between 
the change in symptom score and the AT!/2 (p = 
—0.171; p = .457; n = 20). 

According to the answers to the additional ques- 
tions on the questionnaire, 11 of the 12 patients felt 
subjectively improved by surgery and only 1 felt un- 
changed (patient 6 in Tables 1 and 3, with a postopera- 
tive symptom score of 16). All but 1 of the patients 
(patient 6 in Tables 1 and 3) would recommend ESS 
to other patients with chronic sinus disease. 

Relationship Between T!/2 and Endoscopic Eval- 
uation. According to endoscopic evaluation, 9 pa- 
tients were considered improved and 3 patients un- 
changed. However, there was no significant differ- 
ence in postoperative symptom score between the 
patients who were improved (3; 0 to 9) and those 
who were unchanged (13; 1 to 16) according to en- 
doscopy (p = .209; n = 12). Nevertheless, the values 
of the postoperative T!/2 were significantly lower 





a ee | anae el 2 eal mood 





Improved Unchanged 


than the preoperative values for the sinuses of the 
endoscopically improved patients, and not for the un- 
changed patients (Table 4 and Fig 3). One of the 3 
endoscopically unchanged patients was the subjec- 
tively unchanged patient (patient 6). 


Relationship Between T'/2 and Nitric Oxide Val- 
ues. The NO values at the follow-up visit were 218 
+ 124 ppb (mean + SD) for the right nasal cavity and 
228 + 117 ppb for the left nasal cavity. Only 1 of the 
12 patients had NO values below the normal limit 
(100 ppb): 18 ppb for the right nasal cavity and 70 
ppb for the left nasal cavity. This patient had a post- 
operative symptom score of 8 and pathological post- 
operative T!/2 values for all paranasal sinuses (pa- 
tient 8; Table 3). 


DISCUSSION 


The 133-xenon washout measurements showed 
that the ventilation of the paranasal sinuses was im- 
proved after ESS, especially in sinuses in which the 
preoperative washout T!/2 was abnormal. These re- 
sults indicate that the change in the T!/2 of 133-xenon 
washout after sinus surgery provides a useful objec- 
tive measure of improvement in paranasal sinus ven- 
tilation. However, the interpretation of a finding of 
no activity in a sinus constitutes a problem. In the 
present study, a relatively high number of sinuses 
displayed no activity (Table 3), in contrast to healthy 
sinuses.? In preoperative measurements, this finding 
may indicate a lack of ventilation of the sinus, but in 
the postoperative measurement, it could also reflect 
very rapid ventilation due to a wide neo-ostium 
created by the surgeon. This phenomenon underlines 
the importance of including other assessments, such 
as CT and endoscopy, with measurements of xenon 
washout. Further, the presence of accessory ostia 
could possibly give a low T!/2 for maxillary sinuses, 
indicating normal ostial function even in cases of ob- 
struction of the main ostium that would lead to insuf- 
ficient mucociliary clearance. 
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In the present study, imprevement in ventilation 
of the paranasal sinuses was found mainly in patients 
with nasal polyposis. We found no significant im- 
provement in ventilation in the sinuses of the patients 
treated for chronic sinusitis. This finding is probably 
due to the high number of sinuses with preoperative 
T!/2 values within normal limits in this group. This 
finding, that some of these patients had normal pre- 
operative T!/2 values, is interesting and raises the 
question of whether these pat-ents were too healthy 
to undergo ESS. However, on2 should keep in mind 
that poor ventilation is only one component of the 
pathophysiology of chronic sinus disease, and other 
factors, such as mucociliary clearance, were not eval- 
uated in our study. 


At the follow-up examination, a subjective im- 
provement was found in more than 90% of the pa- 
tients in the present study. This is in accordance with 
the results of other studies. In the patients exhibit- 
ing moderate or marked reductions of symptoms, si- 
nus ventilation was significantly improved. However, 
we could not find any direct correlation between a 
reduction in symptoms and :mproved ventilation 
(AT!/2) of the paranasal sinuses. A possible explana- 
tion may be the relatively small number of patients 
in our study. Sinus ventilation was significantly im- 
proved in the 9 patients in whom postoperative endo- 
scopic evaluation also showed improvement. Al- 
though there was a significant difference in time be- 
tween the subjective evaluation and the postopera- 
tive measurements of 133-xenon washout, which may 
have influenced the interpretation of the responses 
to the questionnaire, there was no significant differ- 
ence between the symptom scores at the follow-up 
visit and the symptom scores 1 year after operation. 
Thus, it seems that the bias introduced is of little im- 
portance for the conclusions in the present study. 


The majority of our patients were asymptomatic, 
or at least had few symptoms, at the follow-up exam- 
ination. Most displayed normal postoperative T!/2 
levels, indicating normal sinus ventilation. Normal 


sinus function after surgery was further demonstrated 
by the finding that 11 of the patients had normal na- 
sal NO levels. Patients with symptomatic chronic 
sinusitis have been shown to have subnormal NO 
levels.!® It has been proposed that the paranasal si- 
nuses produce, and may also act as reservoirs for, 
NO.'4 In this context, the observation that the only 
patient with subnormal nasal NO levels also had a 
pathological postoperative T!/2 is interesting. 


The present study shows connections among re- 
duction in symptoms, endoscopic resolution of dis- 
ease, and improved ventilation after ESS. Kennedy 
et al? used endoscopic evaluation as an objective 
measure of the outcome of ESS, but did not find any 
correlation between endoscopic resolution of muco- 
sal abnormalities and symptom improvement. How- 
ever, one may argue that endoscopic evaluation can- 
not be considered a true objective measure, as there 
may be differences in the evaluations of the endos- 
copists and some bias may be introduced. True objec- 
tive measures were studied by Lund and Scadding,® 
who used ciliary beat frequency as an objective mea- 
sure of mucociliary function. They found a signifi- 
cant postoperative reduction in symptoms and im- 
provement in mucociliary function and olfaction. 
However, correlations between symptoms and meas- 
urements were not studied. In a study by Asai et al,}5 
a correlation was observed between the results of a 
saccharin test and the appearance of the sinus mu- 
cosa after ESS. Mucociliary function tests and the 
present 133-xenon washout method complement 
each other in the objective evaluation of the patho- 
physiology of the paranasal sinuses before and after 
ESS. 


The correlations between improvement in symp- 
toms, endoscopic improvement, and ventilation of 
the sinuses operated on indicate that the 133-xenon 
washout technique is appropriate for postoperative _ 
evaluation of the results of ESS. The use of the meth- 
od before operation, for prediction of the outcome 
of surgery, requires further study. 
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CONTRIBUTION OF SPEECH RATE TO SPEECH PERCEPTION IN 
MULTICHANNEL COCHLEAR IMPLANT USERS 
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MITSUYOSHI NAGURA, MD 


SHUJI OCHO, MD 
TOMOYUKI HOSHINO, MD 
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This study describes the effect of speech rate (fast, 11 syllables per second: medium, 9 syllables per second; slow, 6 syllables per 
second) on speech perception in 10 cochlear implant users. The speech perception performance was evaluated on the basis of the 
percentage score of syllables that were correctly recalled in sentences composed of 4 to 6 words. The percentage scores at the fast, 
medium, and slow speech rates were 15.7%, 39.0%, and 56.0%, respectively. The effect of speech rate slowing was significant (p < 
.0001). Variations in the effect of speech rate slowing were observed in the cochlear implant users. The improvement of speech 
perception by speech rate slowing was significantly (p < .005) related to the word test score and the score at the fast speech rate. The 


results reveal that the rate of speech is an important factor in improving the speech perception of cochlear implant users. 


KEY WORDS — cochlear implant, speech perception, speech rate. 


INTRODUCTION 


The cochlear implant improves hearing and speech 
perception in postlingually deafened adults. Speech 
perception testing using monosyllables, words, and 
sentences has been performed to evaluate the effects 
of cochlear implant devices and speech coding strat- 
egies,!. 


Advances in cochlear implant technology have in- 
creased the amount of speech processing informa- 
tion. Current cochlear implants encode the entire 
speech signal and present it at high speed. In spite 
of technological improvements, however, the access 
to auditory information is limited. The limited pro- 
cessing ability of cochlear implant systems makes it 
difficult for users to understand rapid speech. The 
slowing of speech has been reported to improve 
speech comprehension for elderly adults.4> The 
speech rate may be also an important factor in speech 
perception performance in cochlear implant users. 
Recent improvements in the speed of digital signal 
processors have enabled time-scale modification of 
speech signals in real time. Nakamura et alf proposed 
a real-time speech rate conversion system that slowed 
speech rates by reducing the duration of pauses and 
increasing the duration of voiced segments, and im- 
proved results in sentence recognition tests were ob- 
tained. Therefore, the purpose of this study was to 
examine the effects of speech rate slowing on speech 
comprehension in cochlear implant users deafened 
postlingually by recall tests of sentences composed 
of 4 to 6 words. 


METHODS 


The subjects were 10 postlingually deafened adults 
(3 men and 7 women) ranging from 41 to 64 years of 
age, with a mean age of 54.2 years, who use the Nu- 
cleus 22-channel cochlear implant system with the 
spectral peak (Speak) coding strategy. All patients 
were native speakers of Japanese. This study was per- 
formed between 23 and 47 months after operation 
(mean, 34 months). The average number of activated 
electrodes was 21.2 (range, 19 to 22), and they were 
programmed in the bipolar plus 1 electrode pairing 
mode in all subjects. Additional background informa- 
tion for all subjects is provided in Table 1. 


Each subject heard a total of 60 sentences recorded 
digitally by a male native speaker of Japanese with 
normal intonation. The length of each sentence, which 
was drawn from materials published by Advanced 
Telecommunications Research Institute International, 
was between 4 and 6 words (average, 5.02). Three 
speech rate variations were created for each sentence: 
fast (11 syllables per second), medium (9 syllables 
per second), and slow (6 syllables per second). The 
normal average speaking rate for Japanese announc- 
ers is roughly 5 to 9 syllables per second.’ Twenty 
sentences were used for each speech rate. The male 
speaker was recorded (Sony ECM-737 Electrovoice 
microphone and Sony TCD-D7 digital audiotape cas- 
sette deck) on digital audiotape while reading these 
sentences at 3 different rates of speech in a sound- 
attenuated room. 


The sentences were amplified (Technics SU-A700) 
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TABLE 1. PATIENT INFORMATION 











Age at Duration of Duration of Cause of No. of Hearing 
Implantation Ear Profound Rehabilitation Profound Activated Level in Ear 
Patient Sex (y) Implanted Deafness (mo) Deafness Electrodes Implanted (dB) 

1 M 55 J% 37 mo 32 Meningitis 22 Scale out 
2 F 50 L 7y 35 Unknown 20 116 
3 M 53 E 20 y 42 Streptomycin 21 108 
4 F 63 R 30 mo 33 Syphilitic 22 101 

labyrinthitis 

5 F 4] R 49 mo 40 Labyrinthitis 21 Scale out 

6 M 61 E 22 mo 35 Labyrinthitis 21 Scale out 
7 F 52 R 6 mo 30 Unknown 22 101 
8 F 58 L; 24 mo 23 Unknown 22 130 

9 F 49 R 24 mo 47 Meningitis 19 Scale out 
10 F 60 R 4y 23 Unknown 22 120 





and then presented to individual subjects over a loud- 
speaker (Panasonic SB-MX30) in a sound-attenuated 
room. Each subject sat about 1 m in front of the loud- 
speaker. The sentences were presented at a comfort- 
able listening level (approximately 56 dB sound pres- 
sure level). The subjects were instructed to listen 
carefully to each sentence and then to recall it as ac- 
curately as possible immediately after the sentence 
was presented. A strict verbatim criterion was used 
for scoring. The sentences were scored according to 
how many syllables were recalled exactly as origi- 
nally presented. Stat View 4.51 software was used 
for statistical analysis. 


RESULTS 


The average score on the speech perception test at 
each speech rate is shown in Fig 1. The average scores 
at the fast, medium, and slow speech rates were 
15.7%, 39.0%, and 56.0%, respectively. The results 
of the analyses revealed a significant effect of speech 
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Fig 1. Percentage of average recall scores for each pat- 
tern of speech rate change: fast, 11 syllables per second; 
medium, 9 syllables per second; and slow, 6 syllables 
per second. Average scores were 15.7%, 39.0%, and 
56.0%, respectively. These results revealed significant 
effect of speech rate slowing. Asterisk — p < .0001. 


rate (p < .0001). There was a significant improve- 
ment in speech perception from speech rate slowing 
for the cochlear implant users. The individual scores 
at each rate of speech are shown in Fig 2. There are 
3 groups for the effect of speech rate slowing: high 
degree of effect (A group), middle effect (B group), 
and poor effect (C group). 


We evaluated the effect of speech rate slowing as 
the improvement index, which is calculated as the 
score at the slow rate minus the score at the fast rate. 
The mean and standard deviation for the improve- 
ment index was 39.3% + 14.8%. The patients scored 
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Fig 2. Individual recall scores of cochlear implant users 
for each rate (fast, medium, and slow) of speech. Effects 
of speech rate slowing are classified into 3 groups: high 
degree of effect (A group), middle effect (B group), and 
poor effect (C group). 
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TABLE 2. CORRELATION BETWEEN EFFECT OF 
SPEECH RATE SLOWING (IMPROVEMENT INDEX) 
AND OTHER FACTORS 











Sex >.1 
Age at implantation >.1 
Duration of profound deafness >.1 
Duration of rehabilitation >.1 
Monosyllable test score S] 
Word test score <.005 
Score at fast speech rate <.005 


Improvement index (%) equals score at slow speech rate (%) minus 
score at fast speech rate (%). 





with an average accuracy of 49% (range, 34% to 
72%) on the monosyllable test and 66% (range, 40% 
to 88%) on the word test. Table 2 shows the results 
for the correlation between the improvement index 
and factors concerned with cochlear implantation, 
which are sex, age, duration of profound deafness, 
duration of rehabilitation, monosyllable test score, 
word test score, and score at fast speech rate. In par- 
ticular, there were significant (p < .005) correlations 
between the improvement index and the scores on 
the word test and at the fast speech rate. However, 
no significant effects of sex, age at implantation, du- 
ration of profound deafness, duration of rehabilita- 
tion, or monosyllable test score were observed (p > 
.1). The cochlear implant users who achieved a high 
recognition performance on the word test and at a 
fast speech rate had a significantly high improve- 
ment index score. These findings indicate that speech 
rate slowing is appropriate for those cochlear implant 
users who show good speech recognition. 


DISCUSSION 


Vowel, consonant, word, and sentence recognition 
tests, as well as speech tracking tests, are performed 
as part of a clinical evaluation of the multichannel 
cochlear implant.8°? Speech recognition ability is an 
important prognostic indicator of improvement in 
speech perception ability.!° Age, duration of deafness, 
processor strategy, good gap detection and frequency 
discrimination during preoperative promontory test- 
ing, large numbers of electrodes in use, and greater 
dynamic ranges for intracochlear electrical stimula- 
tion have been associated with the improvement of 
speech recognition. !!-!3 However, it is important not 
only to test the speech recognition ability of patients, 
but also to make some assessment of what the re- 
sults mean in terms of patient benefit. The process- 
ing of syntactic, semantic, and contextual informa- 
tion in the speech signal and the use of prosodic cues 
may be important for the improvement of speech 
perception. '4 Therefore, the efficient transmission of 
information from the cochlear prosthetic system to 


the central auditory processor has led to the improve- 
ment of speech perception. 


Rapid speech makes it difficult to transmit much 
information from the cochlear implant system to the 
central auditory processor. Age-related deficits in pe- 
ripheral auditory function, such as loss of hearing 
sensitivity and poor frequency selectivity, are well 
recognized as the cause for age-related hearing loss. 
Some reports* have demonstrated that speech slow- 
ing improves speech perception in older adults. The 
finding of an improvement in speech perception for 
cochlear implant users from the slowing of the speech 
rate was consistent with these reports. However, the 
improvements from fast to slow rate may occur for 
at least 3 different reasons: 1) fast speech may not 
be articulated as clearly as slow speech, reducing the 
quality of the phonetic information available in the 
fast recordings; 2) the speech processor may not ex- 
tract or encode as much information from the fast 
recordings because of the limited analysis time reso- 
lution, the temporal smoothing of parameters (band- 
width), and the pulse rate used to present informa- 
tion; 3) the listener may not be able to process all of 
the available perceptual information at the fast rate. 
The question as to which of the 3 explanations ac- 
counts for the differences in speech perception scores 
will require further research. 


Individual scores at each rate of speech showed 
that there were variations in the effect of speech rate 
slowing in cochlear implant users, and the effects 
are classified into 3 groups. The effect of speech rate 
slowing was high in cochlear implant users who 
achieved a high performance score at a fast speech 
rate. However, the effect was low in the cochlear im- 
plant users who achieved a poor performance at a 
fast speech rate. The present study showed a signifi- 
cant improvement in speech perception on speech 
rate slowing for all cochlear implant users, especially 
those who achieved high recognition performance 
scores on the word test and at the fast speech rate. 
However, there were no significant correlations be- 
tween the improvement in speech perception (the im- 
provement index) and sex, age at implantation, du- 
ration of profound deafness, duration of rehabilita- 
tion, or monosyllable test score. These findings sug- 
gest that speech rate slowing is more effective for 
cochlear implant users who have a good ability for 
syntactic and semantic processing through the cen- 
tral auditory system; in other words, speech rate slow- 
ing is likely to assist this ability. The speech percep- 
tion performance test with speech rate slowing is use- 
ful as a different test battery for the realistic assess- 
ment of the benefits to cochlear implant users. 


Speech rate slowing has been confirmed to be an 
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important factor for the improvement of speech per- 
ception in cochlear implant users in the present study. 
Recent improvements of digital signal processors 
have enabled real-time modification of speech sig- 
nals. Nejime et al!5 reported a real-time portable de- 





vice that slows speech speed. We expect the applica- 
tion of speech rate slowing to provide substantial 
benefits in communication in practice and training 
with the multichannel cochlear implant and an ad- 
vanced processor strategy. 
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TEMPORAL BONE MORPHOMETRIC STUDY ON THE EUSTACHIAN 
TUBE AND ITS ASSOCIATED STRUCTURES IN PATIENTS WITH 
CHROMOSOMAL ABERRATIONS 
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This study characterized phenotypic anomalies of the eustachian tube (ET) and its associated structures in individuals with 
various chromosomal aberrations (trisomies 13, 18, 21, and 22 and inversion of chromosome 1). The morphological characteristics 
of the ET and its accessory structures from 10 temporal bone-ET specimens, obtained from 10 individuals with chromosomal 
aberrations who ranged from the 26th gestational week to 1 year in age, were compared with the same structures from 21 age- 
matched control subjects without anomalies. The subjects with chromosomal aberrations had a significantly smaller volume of the 
lateral lamina (LL) of the ET cartilage, a reduced attachment of the tensor veli palatini muscle (TVPM), and, in some cases, a reduced 
volume of the lumen of the cartilaginous ET. The volume of the medial lamina (ML) of the ET cartilage was normal at birth, but 
smaller in specimens more than 2 months of age; this finding suggests that the prenatal development was normal, but that the 
postnatal growth was retarded. The subjects with chromosomal aberrations also displayed a smaller ratio of the volume of the LL to 
that of the ML (LL/ML ratio). Our results suggest that individuals with various chromosomal aberrations have rather similar anoma- 
lies of the ET and its associated structures. We speculate that these anomalies might be closely related to ET dysfunction in these 


patients. 


KEY WORDS — anomaly, chromosomal aberration, eustachian tube, histopathology, temporal bone. 


INTRODUCTION 


An understanding of the pathogenesis of eusta- 
chian tube (ET) dysfunction in patients with congeni- 
tal anomalies requires an understanding of the link- 
age between genetic mutations and phenotypic fea- 
tures of the ET and related structures. One important 
source of data for this endeavor is the comparative 
analysis of morphological differences of the ET and 
its associated structures between normal and anoma- 
lous cases. Although several histologic studies have 
evaluated the ET and its associated structures in nor- 
mal subjects,!~4 there is limited information describ- 
ing histopathologic abnormalities of the ET and its 
associated structures in cases of anomalies with a spe- 
cific genetic basis.>- 


Chromosomal aberrations such as trisomies 13, 18, 
21, and 22 are known to cause diverse congenital 
anomalies. For example, patients with various chro- 
mosomal aberrations have been reported to display 
a variety of external, middle, and inner ear anoma- 
lies,” including anomalies of the ET.? However, there 
has been no systematic survey to determine whether 
individuals with different chromosomal aberrations 
share acommon set of ET anomalies. Hence, the pur- 


pose of this study was to quantitatively evaluate anom- 
alies of the ET and its associated structures in pa- 
tients with trisomies 13, 18, 21, and 22 and inver- 
sion of chromosome | and to compare these features 
with the morphological findings from age-matched 
control subjects. 


MATERIALS AND METHODS 


The materials used for this study were 31 left tem- 
poral bone—-ET specimens obtained from 31 individu- 
als. Ten specimens were obtained from 10 individu- 
als who had multiple congenital anomalies with vari- 
ous chromosomal aberrations (trisomies 13, 18, 21, 
and 22 and inversion of chromosome 1), ranging in 
age from the 26th gestational week to | year old at 
death (see Table). Twenty-one age-matched control 
specimens were obtained from 21 individuals with- 
out any known congenital anomaly or histologic evi- 
dence of ear disease, ranging in age from the 26th 
gestational week to | year old at death. 


The specimens were fixed in 10% formalin, decal- 
cified in 5% trichloroacetic acid, dehydrated in graded 
concentrations of ethanol, and embedded in celloi- 
din. The celloidin blocks were sectioned at 30 um in 
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CASES OF CHROMOSOMAL ABERRATIONS 








Case Age Sex Chromosomal Aberration 
1 1d (26 gw) M Trisomy 21 
2 1d(34gw) M Trisomy 22 
3  1d(38 gw) M Trisomy 18 
4 4d(36gw) F Trisomy 13 

5 2 wk(35 gw) M Inversion of chromosome | 
6 2wk(fullterm) F Trisomy 13 
7 6wk(fullterm) F Trisomy 13 
8 3mo (fullterm) F Trisomy 22 
9 6mo (37 gw) M _ Trisomy 21 
10 l y (full term) F Trisomy 21 


gw — gestational weeks. 





thickness as close to perpendicular to the long axis 
of the ET as possible.!° Every 20th section was stained 
with hematoxylin and eosin and prepared for a light 
microscopic study and a computer-assisted measure- 
ment. The volume of the ET cartilage and lumen and 
the ratio of attachment of the tensor veli palatini mus- 
cle (TVPM) tothe ET cartilage were estimated in each 
case. 


Postnatal growth of the head is rapid in infants. 
For example, head circumference increases from 34 
to 35 cmat birth to approximately 40 cm by 3 months, 
43 cm by 6 months, and 46 to 47 cm by 1 year.!! To 
minimize the influence of postnatal growth on data 
analysis, each case was compared with age-matched 
controls. 


The equipment used for measurement consisted of 
a personal computer (NEC PC, 9801RL), a frame buf- 
fer (Sapience HyperFrame+) for full-color images, 


Fig 2. Photomicrographs of ver- 
tical section of left ET and its as- 
sociated structures obtained from 
2 control cases. A) One-month- 
old boy. B) Four-day-old boy 
(35th gestational week). Tendon- 
like membrane (arrows) of tensor 
veli palatini muscle (TVPM) is at- 
tached into perichondrium (*) at 
tip of lateral lamina (LL) of ET 
cartilage in each case. L — lumen 
of ET; LVPM — levator veli 
palatini muscle; ML — medial 
lamina of ET cartilage. Right, left, 
superior, and inferior sides of im- 
ages are anterolateral, posterome- 
dial, superior, and inferior direc- 
tions, respectively. 






Medial Lateral 


TVPM 


| Inferior 


Fig 1. Schematic drawing of cross section in midcarti- 
laginous portion of left eustachian tube (ET) and its as- 
sociated structures. Eustachian tube cartilage (ETC) is 
divided into lateral lamina (LL) and medial lamina (ML) 
by line connecting most superior and most inferior points 
(*) on margin of ET lumen (L). TVPM — tensor veli 
palatini muscle. 


a digitizer (Hitachi HDG-1515, resolution 0.1 mm), 
and the software developed by Takagi and Sando, !? 
modified by Fujita and Sando,'3 and revised by Sudo 
et al.3 


Calculation of Volume of ET Cartilage. First, the 
ET cartilage on every 20th section was divided into 
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Fig 3. Volume of eustachian tube cartilage (ETC) in both chromosomal aberration group (filled circles; Y1) and control group 
(empty circles; Y2) as function of age. Age in each case is adjusted by gestational week at birth, which means that l-day-old 
newborn at 40th gestational week is indicated as 0 months of age. Regression lines in both groups are also shown. A) Volume 
of medial lamina (ML) of ETC. B) Volume of lateral lamina (LL) of ETC. C) Ratio of volume of LL to that of ML (LL/ML 


ratio). D) Volume of total ETC. 


a lateral lamina (LL) and a medial lamina (ML) by a 
vertical line connecting the most superior and most 
inferior points on the margin of the ET lumen (Fig 
1). After the cross-sectional areas of the LL and ML 
were measured, the volumes of the LL and ML were 
computed as the sums (across sections) of the cross- 
sectional areas of the LL and ML, respectively. The 
total volume of the ET cartilage was also the sum 
(across sections) of the cross-sectional areas of the 
total ET cartilage. The ratio of the volume of the LL 
to that of the ML (LL/ML ratio) was also calculated. 


Calculation of Volume of ET Lumen. The pharyn- 
geal orifice of the ET lumen was defined as the most 
anterior section containing a complete cross section 
of the mucosal lining of the ET. The tympanic ori- 
fice of the ET lumen was defined similarly as the 
most posterior ET section in proximity to an ante- 


rior-superior margin of the tympanic annulus. The 
ET lumen was divided into 3 parts: the cartilaginous 
portion, the junctional portion surrounded partly by 
the cartilage and partly by the bone,’ and the bony 
portion. The pharyngeal margin of the junctional por- 
tion was defined as the most anterior site in which 
the bone surrounded the ET lumen. Similarly, the tym- 
panic margin of the junctional portion was defined 
as the most posterior site in which the ET cartilage 
was present. The volume in each part of the ET lu- 
men was calculated as the sum of the cross-sectional 
areas of all of the sections from each part of the ET 
lumen. The total volume of the ET lumen was de- 
fined as the sum of the cross-sectional areas from all 
of the sections through the ET lumen. 


Calculation of Ratio of Attachment of TVPM to 
ET Cartilage. The site of attachment of the TVPM 
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Fig 4. Photomicrograph of vertical section of left 
ET and its associated structures obtained from 1- 
day-old boy (case 2) in chromosomal aberration 
group. Lateral lamina of ET cartilage is absent. 
Furthermore, tensor veli palatini muscle (TVPM) 
is not attached to ET cartilage. Cross section of 
ET lumen (L) shows extremely slitlike shape with- 
out curvature. LVPM — levator veli palatini 
muscle; ML — medial lamina of ET cartilage. 
Right, left, superior, and inferior sides of image 
are anterolateral, posteromedial, superior, and in- 
ferior directions, respectively. 


to the ET cartilage was defined by the insertion of 
the tendonlike membrane of the TVPM into the peri- 
chondrium at the tip of the LL of the ET cartilage!4 
(Fig 2). The attachment of the TVPM was evaluated 
on every 20th section to determine its length along 
the long axis of the ET. The attachment was also ex- 
pressed as the ratio of the length of the TVPM at- 
tachment to the length of the ET cartilage. 


RESULTS 


Volume of ET Cartilage. The volume of the ML of 
the ET cartilage increased linearly with age in both 
the control subjects (r? = 0.728; F(1,19) = 50.822; p 
< .01) and the subjects with chromosomal aberra- 
tions (r? = 0.792; F(1,8) = 30.393; p < .01; Fig 3A). 
Analysis of variance (using age as a covariate) of all 
data revealed that the group with chromosomal ab- 
errations had a smaller ML than did the control sub- 
jects (F(1,28) = 5.349; p < .05). This difference was 
attributed to a hypoplastic ML in the 3 subjects great- 
er than 2 months of age (cases 8, 9, and 10); the vol- 
umes of the ML (mean + SD) in younger subjects 
(less than 2 months) from the control (46.93 + 14.99 
mm->) and chromosomal aberration (36.34 + 10.71 
mm?) groups did not differ significantly. These find- 
ings suggest that there may be normal prenatal but 
decreased postnatal development of the ML in the 
cases of chromosomal aberrations. 


The volume of the LL increased linearly with age 
in only the control specimens (r? = 0.452; F(1,19) = 
15.678; p < .01); the age relationship for the chromo- 
somal aberration group was not significant (r? = 
0.103; F(1,8) = 0.915; not significant; Fig 3B). The 
LL volume in the chromosomal aberration group (for 
all of the specimens, 4.99 + 4.27 mm?) was signifi- 
cantly less than the LL volume in either the entire 
control group (14.09 + 7.95 mm?; separate variance 
(28.4) = —4.14; p < .01) or the control subjects less 





— 


= 


TVPM 
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than 2 months of age (11.50 + 4.49; separate vari- 
ance t(19.8) = —3.75; p < .01). The differences were 
particularly striking in case 2, because the LL was 
absent along the entire length of the ET cartilage (Fig 
4). It is important to note, though, that the single sub- 
ject with inversion of chromosome | (case 5) had a 
normal LL volume. However, these findings suggest 
that a hypoplastic LL was a common phenotypic fea- 
ture in trisomies of chromosomes 13, 18, 21, and 22. 


The ratio of the volume of the LL to the volume of 
the ML (LL/ML ratio) provides a gauge of the rela- 
tive volumes of these portions of the ET cartilage. 
Regression analysis indicated that the LL/ML ratios 
did not vary significantly with age in either the con- 
trol group or the group with chromosomal aberra- 
tions (Fig 3C). As expected from the findings above, 
the LL/ML ratio was significantly smaller (separate 
variance; t(18.5) = —4.35; p < .01) in the group with 
chromosomal aberrations (0.103 + 0.080) than in the 
control group (0.238 + 0.083). It is also important to 
note that the sole subject with inversion of chromo- 
some | (case 5) had a normal LL/ML ratio. 


The total volume of the ET cartilage also increased 
linearly with age in both the control subjects (r? = 
0.701; F(1,19) = 44.535; p < .01) and the subjects 
with chromosomal aberrations (r? = 0.743; F(1,8) = 
23.100; p < .01; Fig 3D). Analysis of variance (with 
age as a covariate) indicated that the total ET carti- 
lage volume was significantly smaller in the group 
with chromosomal aberrations (F(1,28) = 7.97; p < 
.01). Because the same analysis revealed no signifi- 
cant difference in the specimens less than 2 months 
of age (F(1,21) = 3.318; p > .05), the smaller size 
probably reflects decreased postnatal growth in the 
latter group. 


Volume of ET Lumen. The lumen volume of the 
cartilaginous portion of the ET increased significantly 
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Fig 5. Volume of ET lumen in both chromosomal aberration group (filled circles; Y1) and control group (empty circles; Y2) as 
function of age. Age in each case is adjusted by gestational week at birth. Regression lines in both groups are also shown. A) 
Volume of lumen in cartilaginous portion of ET (CL volume). B) Volume of lumen in junctional portion of ET (JL volume). C) 
Volume of lumen in bony portion of ET (BL volume). D) Volume of total lumen of ET. 


with age in only the control group (r? = 0.651; F(1,19) 
= 35.432; p < .01); the age relationship for the chro- 
mosomal aberration group was not significant (r? = 
0.376; F(1,8) = 4.821; p = .0594; Fig SA). Analysis 
of variance (with age as a covariate) indicated that 
the lumen of the cartilaginous portion was smaller 
in the chromosomal aberration group than in the con- 
trol group (F(1,28) = 4.81; p < .05). However, 3 of 
the cases of chromosomal aberrations showed vol- 
umes within the normal range: 2 trisomy 13 cases 
(cases 4 and 6) and | trisomy 21 case (case 10). Nei- 
ther the volume of the lumen of the bony ET (F(1,28) 
= 0.61; not significant) nor the volume of the lumen 
of the junctional region of the ET (F(1,28) = 0.29; 
not significant) differed significantly between groups 
(Fig 5B,C). Finally, analysis of variance (with age 
as a covariate) revealed no significant intergroup dif- 
ference in the total volume of the ET lumen (F(1,28) 


= 3.75; p = .063; Fig 5D). 


TVPM Attachment Ratio. Despite the presence of 
a well-developed TVPM, 5 subjects with chromoso- 
mal aberrations (cases 1, 2, 4, 6, and 8) lacked an 
attachment of the TVPM to the ET cartilage in every 
section (Figs 4 and 6), whereas all control subjects 
displayed a TVPM attachment (Fig 2). Regression 
analysis indicated that the TVPM attachment ratio 
did not vary significantly as a function of age in ei- 
ther group (Fig 7). The ratios were 0.111 + 0.142 in 
the chromosomal aberration group and 0.358 + 0.105 
in the control group. These ratios differed significant- 
ly (separate variance (13.9) = —4.91; p < .01). It is 
important to note, though, that 2 anomaly cases had 
TVPM attachment ratios in the normal range: the 
chromosome | inversion case (case 5) and a trisomy 
18 case (case 3). Finally, there was a significant re- 
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Fig 6. Photomicrograph of vertical section of left ET and 
its associated structures obtained from 2-week-old girl 
(case 6) in chromosomal aberration group. Tensor veli 
palatini muscle (TVPM) is not attached (*) to poorly 
developed lateral lamina (LL) of ET cartilage. L — lu- 
men of ET; LVPM — levator veli palatini muscle; ML 
— medial lamina of ET cartilage. Right, left, superior, 
and inferior sides of image are anterolateral, posterome- 
dial, superior, and inferior directions, respectively. 


lationship between the TVPM attachment ratio and 
the LL volume (r? = 0.505; p < .01 for all data). 


DISCUSSION 


Chromosomal aberrations occur in 0.4% of live 
births. Among them, trisomy 21 is the most com- 
mon autosomal aberration, with an incidence in the 
general population of 1 in 600 to 800 live births. The 
incidences of trisomies 18 and 13 are approximately 
1 in 6,000 and 1 in 10,000 live births, respectively. !5 
Although trisomy 22 is considered to be the second 
most common autosomal trisomy found in spontane- 
ous abortions,!© liveborn infants with this trisomy 
have rarely been reported.!7:!8 Some chromosomal 
inversions can give rise to the birth of children with 
abnormalities resulting from duplications and defi- 
ciencies of a recombinant chromosome, and a few 
reports have been published regarding inversion of 
chromosome 1.!9 Most individuals with these chro- 
mosomal aberrations exhibit consistent and specific 
phenotypes depending on the involved chromosomes. 
However, individuals with chromosomal anomalies 
often share several similar clinical features, includ- 
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Fig 7. Ratio of attachment of tensor veli palatini muscle 
(TVPM) to eustachian tube cartilage (TVPM attachment 
ratio) in both chromosomal aberration group (filled 
circles; Y 1) and control group (empty circles; Y2) as func- 
tion of age. Age in each case is adjusted by gestational 
week at birth. Regression lines in both groups are also 
shown. 


ing congenital heart disease, mental retardation, and 
low-set and malformed ears.!> Our present study re- 
vealed that these features are also accompanied by a 
hypoplastic LL of the ET cartilage and a reduced 
TVPM attachment to the LL. The majority of the 
specimens also showed a reduced volume of the lu- 
men in the cartilaginous portion of the ET. 


The temporal bones from individuals with chromo- 
somal aberrations showed significant reductions in 
LL volume and in TVPM attachment ratio (compared 
to age-matched controls). These results confirmed 
that the volume of the ET cartilage (especially the 
LL) is significantly smaller in subjects with chromo- 
somal aberrations than in control subjects. Because 
there was no marked difference in the volume of the 
ML of the ET cartilage, the marked reduction in the 
LL volume resulted in abnormally small LL/ML ra- 
tios in the subjects with chromosomal aberrations. 
The concomitant reduction in the attachment of the 
TVPM, the primary dilator of the ET,2°*! has not 
been reported previously. Contraction of the TVPM, 
in normal individuals, seems to move the LL of the 
ET cartilage away from the ML in a hingelike ac- 
tion.! Therefore, poor attachment of the TVPM to 
the LL of the ET cartilage, as well as a small LL, is 
easily expected to result in an ineffective ET dila- 
tion when the contraction occurs. It is important to 
note that 5 of the 10 subjects (cases 1, 2, 4, 6, and 8) 
with chromosomal aberrations had no attachment of 
the TVPM to the ET cartilage. This result suggests 
that inadequate ET dilation in these individuals may 
be attributed to similar phenotypic anomalies of the 
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LL-TVPM complex. This result is consistent with 
the previous report by White et al?? demonstrating 
that 95% of patients with Down syndrome could not 
reduce negative middle ear pressure by swallowing. 


The total volume of the ET lumen did not exhibit 
a marked difference between individuals with chro- 
mosomal aberrations and control cases. However, the 
lumen volume in the cartilaginous ET was reduced 
in individuals with chromosomal aberrations. Suzuki 
et al? reported that the ET lumen of the cartilaginous 
portion in children is narrow and tubular. They con- 
cluded that the normal lumen of the cartilaginous ET 
has limited capacity for dilation in children. Hence, 
children with chromosomal aberrations may have the 
combined disadvantages of both a reduced lumen in 
the cartilaginous ET and anomalies of the LL-TVPM 
complex. 


The ET is believed to be derived from the expand- 
ing terminal end of the endoderm-lined first and sec- 
ond pharyngeal pouches.?3 Azcording to Tos,”4 the 
ML of the ET cartilage begins to develop at the end 
of the 12th gestational week, whereas the LL starts 
developing during the 16th gestational week. The ET 
cartilage attains the final proportions of the LL and 
ML during the 20th gestational week. On the other 
hand, the TVPM is well developed by the end of the 
11th gestational week. Therefcre, a poorly developed 
LL of the ET cartilage in the subjects with chromo- 
somal aberrations may suggest an interruption that 





occurred earlier than the 20th gestational week. 


Hearing impairment has been reported to be preva- 
lent among individuals with chromosomal aberra- 
tions, including trisomy 21. These individuals are also 
highly susceptible to otitis media with effusion.2° 
Consistent with these clinical observations, Bilgin 
et al?6 reported that 8 of 16 temporal bone speci- 
mens obtained from individuals with trisomy 21 ex- 
hibited otitis media, and Sando et al?” reported that 
temporal bone specimens obtained from individuals 
with trisomy 13 exhibited otitis media in all cases. 
Dysfunction of the ET has been believed to be one 
of the most important etiologic factors in otitis me- 
dia with effusion. The various ET abnormalities ob- 
served in our study are likely to contribute to ET 
dysfunction. 


This study revealed a consistent hypoplasia of the 
LL of the ET cartilage in association with a variety 
of chromosomal aberrations. The specimens used in 
this study represented 3 cases of trisomy 13, 3 cases 
of trisomy 21, 2 cases of trisomy 22, 1 case of tri- 
somy 18, and 1 case of inversion of chromosome 1. 
Further studies using more cases of each type of chro- 
mosomal aberration are required to determine wheth- 
er specific chromosomal aberrations may produce 
specific subtypes of manifestation of anomalies of 
the ET. However, our results suggest that the gene 
that affects development of the LL of the ET carti- 
lage may be located on multiple chromosomes. 


ACKNOWLEDGMENTS — The authors express deep gratitude to Eugene N. Myers, MD, Professor and Chairman of the Department of Oto- 
laryngology, University of Pittsburzh School of Medicine, and Charles D. Bluestone, MD, Professor of Otolaryngology, University of Pitts- 
burgh School of Medicine, and Director, Department of Pediatric Otolaryngology, Children’s Hospital of Pittsburgh, for their encouragement to 
study this subject. Appreciation is zlso expressed to Joyce Ann Horner for her assistance in this study. 


REFERENCES 


1. Sando I, Takahashi H, Matsune S, Aoki H. Localization 
of function in the eustachian tube: a hypothesis. Ann Otol Rhinol 
Laryngol 1994; 103:311-4. 


2. Suzuki C, Balaban C, Sando I, Ganbo T, Kitagawa M. 
Postnatal development of eustachiar tube: a computer-aided 3- 
D reconstruction and measuremen: study. Acta Otolaryngol 
(Stockh) 1998; 118:837-43. 


3. Sudo M, Sando I, Ikui A, Suzuki C. Narrowest (isthmus) 
portion of eustachian tube: a computer-aided three-dimensional 
reconstruction and measurement study. Ann Otol Rhinol Laryn- 
gol 1997; 106:583-8. 


4. Sadler-Kimes D, Siegel MI, Todhunter JS. Age-related 
morphologic differences in the components of the eustachian 
tube/middle ear system. Ann Otol Rhinol Laryngol 1989;98:854- 
8. 


5. Shibahara Y, Sando I. Histopathologic study of eustachi- 
an tube in cleft palate patients. Ann Gtol Rhinol Laryngol 1988; 
97:403-8. 

6. Yamaguchi N, Sando I, Hashida Y, Takahashi H, Matsune 
S. Histologic study of eustachian tube cartilage with and with- 


out congenital anomalies: a preliminary study. Ann Otol Rhinol 
Laryngol 1990;99:984-7. 


7. Schuknecht HF. Pathology of the ear. 2nd ed. Philadel- 
phia, Pa: Lea & Febiger, 1993:115-90. 


8. Sakashita T, Sando I, Kamerer DB. Congenital anoma- 
lies of the external and middle ear. In: Bluestone CD, Stoll SE, 
Kenna MA, eds. Pediatric otolaryngology. 3rd ed. Philadelphia, 
Pa: WB Saunders, 1996:333-70. 


9. Shibahara Y, Sando I. Congenital anomalies of the eusta- 
chian tube in Down syndrome. Histopathologic case report. Ann 
Otol Rhinol Laryngol 1989;98:543-7. 


10. Sando I, Doyle WJ, Okuno H, Takahara T, Kitajiri M, 
Coury WJ II. A method for the histopathological analysis of 
the temporal bone and the eustachian tube and its accessory 
structures. Ann Otol Rhinol Laryngol 1986;95:267-74. 


11. Neediman RD. The first year. In: Behrman RE, Klieg- 
man RM, Jenson HB, eds. Nelson textbook of pediatrics. 16th 
ed. Philadelphia, Pa: WB Saunders, 2000:32-8. 


12. Takagi A, Sando I. Computer-aided three-dimensional 
reconstruction: a method of measuring temporal bone structures 
including the length of the cochlea. Ann Otol Rhinol Laryngol 
1989;98:515-22. 


13. Fujita S, Sando I. Postnatal development of the vestibu- 
lar aqueduct in relation to the internal auditory canal. Compu- 





a 





r 


Miura et al, Eustachian Tube Anomaly 729 


ter-aided three-dimensional reconstruction and measurement 
study. Ann Otol Rhinol Laryngol 1994;103:719-22. 


14. Matsune S, Sando I, Takahashi H. Insertion of the tensor 
veli palatini muscle into the eustachian tube cartilage in cleft 
palate cases. Ann Otol Rhinol Laryngol 1991;100:439-46. 


15. Hall JG. Chromosomal clinical abnormalities. In: Behr- 
man RE, Kliegman RM, Jenson HB, eds. Nelson textbook of 
pediatrics. 16th ed. Philadelphia, Pa: WB Saunders, 2000:325- 
33. 


16. Warburton D, Stein Z, Kline J, Susser M. Chromosome 
abnormalities in spontaneous abortion: data from the New York 
City study. In: Porter TH, Hook EB, eds. Human embryonic and 
fetal death. New York, NY: Academic Press, 1980:261-87. 

17. Crowe CA, Schwartz S, Black CJ, Jaswaney V. Mosaic 
trisomy 22: a case presentation and literature review of trisomy 
22 phenotypes. Am J Med Genet 1997;71:406-13. 

18. McPherson E, Stetka DG. Trisomy 22 in a liveborn in- 
fant with multiple congenital anomalies. Am J Med Genet 1990; 
36:11-4. 

19. Martin RH, Chernos JE, Lowry RB, Pattinson HA, Bar- 
clay L, Ko E. Analysis of sperm chromosome complements from 
a man heterozygous for a pericentric inversion of chromosome 
1. Hum Genet 1994;93;135-8. 

20. Honjo I, Okazaki N, Kumazawa T. Experimental study 
of the eustachian tube function with regard to its related muscles. 


Acta Otolaryngol (Stockh) 1979;87:84-9. 


21. Cantekin EI, Doyle WJ, Reichert TJ, Phillips DC, Blue- 
stone CD. Dilation of the eustachian tube by electrical stimula- 
tion of the mandibular nerve. Ann Otol Rhinol Laryngol 1979; 
88:40-51. 


22. White BL, Doyle WJ, Bluestone CD. Eustachian tube 
function in infants and children with Down’s syndrome. In: Lim 
DJ, Bluestone CD, Klein JO, Nelson JD, eds. Recent advances 
in otitis media with effusion. Proceedings of the Third Interna- 
tional Symposium. Philadelphia, Pa: BC Decker, 1984:62-6. 


23. Pearson AA. Developmental anatomy of the ear. In: En- 
glish M, ed. Otolaryngology. New York, NY: Harper and Row, 
1978:1-68. 


24. Tos M. Growth of the foetal eustachian tube and its di- 
mensions. Arch Klin Exp Ohren Nasen Kehlkopfheilkd 1971; 
198:177-86. 


25. Strome M. Down’s syndrome: a modern otorhinolaryngo- 
logical perspective. Laryngoscope 1981;91:1581-94. 


26. Bilgin H, Kasemsuwan L, Schachern PA, Paparella MM, 
Le CT. Temporal bone study of Down’s syndrome. Arch Oto- 
laryngol Head Neck Surg 1996;122:271-5. 


27. Sando I, Leiberman A, Bergstrom L, Izumi S, Wood RP 
Il. Temporal bone histopathological findings in trisomy 13 syn- 
drome. Ann Otol Rhinol Laryngol Suppl 1975;84(suppl 21). 


Ann Otol Rhinol Laryngol 111:2002 


EVALUATION OF VERTIGO BY AUDITORY BRAIN STEM RESPONSE 
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The authors examined the hypothesis that abnormal patterns of the auditory brain stem response (ABR) could supplement the 
neuro-otological evaluation and assist in localizing the site of vestibulocerebellar dysfunction. This project is based upon the fact that 
the sources of waves I through V have been regionally identified. Absent or delayed patterns can be referenced to the normal data, 
and the site of a lesion generating vertigo can be established. We found absence of waves or prolonged interpeak latencies in 25% of 
the vertiginous subjects with norma! hearing and magnetic resonance images of the brain. We conclude that in selected cases, lesions 
affecting the vestibular system can influence the ABR, and the electrophysiological tests of audition may suggest regionalization of 


the dysfunction in the hindbrain and midbrain. 


KEY WORDS — auditory brain stem response, vertigo. 


INTRODUCTION 


We investigated the hypothesis that the data from 
the early auditory brain stem response (ABR) can be 
used to localize the region within the brain stem pro- 
ducing vertigo. This concept is based upon the ana- 
tomic proximity between the central auditory path- 
ways and the vestibulocerebellar nuclei and tracts. 
Evidence that confirms focal abnormalities in the 
electrical transmission between the cochlea and the 
medial geniculate nucleus may be correlated with a 
regional disturbance resulting in the symptoms of a 
vestibular dysfunction. 


The classic differentation of peripheral versus cen- 
tral vertigo is based on the constellation of support- 
ive neurologic data and appropriate laboratory inves- 
tigations. “Syndromal vertigo,” a composite of sev- 
eral motor and sensory deficits, is readily located in 
the hindbrain.! 


Further correlation was attempted between mag- 
netic resonance imaging (MRI), magnetic resonance 
angiography (MRA), electronystagmography (ENG), 
and the clinical examination. 


ORIGIN OF EARLY-LATENCY, FAR-FIELD ABR 


In the following section, we briefly summarize the 
relevant historical data that indicated the neuronal 
sources of the waves recorded by the ABR test. Indi- 
viduals with a variety of neurologic disorders were 
evaluated, and a correlation was established between 
the clinical syndrome, abnormalities of the ABR 
waveform, and computed tomography.?3 


The waveforms are considered to be a composite 
of 1) the synchronous firing of parallel axon bundles 


at fixed locations and 2) the neural outflow from the 
regional nuclei. The physiologic components of a 
propagating wave, the nerve action potential, and oth- 
er features of transmission are discussed by Sohmer.* 


Current data derived from human investigations 
suggest that multiple generators may be involved in 
the electrical activity represented by early waves III, 
IV, and V.4 Wave I appears to be a reflection of the 
action potential of the auditory nerve, which is initi- 
ated at the first node of Ranvier in the cochlea. The 
subsequent wave II is considered to originate from 
the central portion of the auditory nerve. Wave III 
emanates mainly from the cochlear nucleus; the fol- 
lowing negative peaks between III and IV have their 
genesis from the cochlear nucleus complex. The con- 
figuration of wave IV is variable, and the actual neu- 
ral generators are poorly understood. The source of 
wave V appears to be the termination of the lateral 
lemniscus in the inferior colliculus. The slow nega- 
tive shift following wave V represents dendritic po- 
tentials from the inferior colliculus. Because waves 
IV and V are often fused in normal subjects, the in- 
dividual amplitude values may not be measurable. 
An overview of the waveforms and the loci of known 
sources suggests that wave I measures the periph- 
eral hearing and that the brain stem is assessed by 
the following waves, eg, waves III through V.56 


CRITERIA FOR INTERPRETATION OF FAR-FIELD, 
EARLY-LATENCY ABR 


Starr and Achor’ reported that the normal ratio of 
the amplitudes of waves IV-V to wave I is 0.5 to 1.0, 
whereas a ratio of less than 0.5 suggests a central 
pathway disorder. Prolonged interpeak latency (IPL) 
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values in excess of 1 ms also suggest a central prob- 
lem, particularly in the later waveforms, such as IN 
to V. Rosenhall et al® considered the latency of wave- 
form V more important and significant than that of 
waves I and III; however, this proposition is limited 
by the variability of waveform V. Goldie commented 
that “the only measurements that seem to be of value 
are the absolute and interpeak measurements I, II], 
and waves V and the amplitude ratio of I to V.”9(P402) 
Markand!° suggested criteria that indicate an abnor- 
mal ABR: 1) absence of all waveforms (excluding 
severe hearing loss); 2) absence of waves following 
I, II, and III; 3) abnormally prolonged IPLs of waves 
I-III, HI-V, and I-V (that are not due to middle ear 
disease or inner ear dysfunction); and 4) an abnor- 
mally diminished V/I amplitude ratio. A simpler clas- 
sification was presented by Chandrasekhar et al,!! 
who suggested that the results were either normal or 
abnormal, and that the anomalous data could be ei- 
ther unilateral or bilateral. Additional criteria included 
an interaural difference greater than 0.2 ms, and ab- 
sence or abnormalities of waveforms. 


The authors concur with this position and propose 
the following, which are based on the clinical corre- 
lation derived from several thousand normal and neu- 
rologically impaired subjects: 1) failure of repeatabil- 
ity of the waveforms; 2) absence of wave II follow- 
ing anormal wave I; 3) failure of registration of wave 
V (including the combined IV-V complex); and 4) 
interaural discrepancy of wave V latency greater than 
0.3 ms. Finally, the general morphology and identi- 
fication of waves III and V (or IV-V complex) are 
the most important configurations in the ABR, where- 
as relative amplitudes are qualitative and of limited 
diagnostic value. 


CLINICAL APPLICATION AND CORRELATION OF 
ABNORMAL WAVEFORMS OF EARLY ABR 


The following material is an overview of the early 
and more current electrophysiological research that 
correlated abnormalities of the ABR pattern with con- 
firmed and well-identified neurologic disease. As a 
corollary to this information, the diagnostic role of 
the auditory evoked responses is defined for clinical 
use. 


The authors have deleted many articles of histor- 
ical importance from this review because of spatial 
constraints. We comment, however, that the abstracted 
reports are sufficient for interpretation of the subse- 
quent data. 


Pathological mechanisms that affect evoked po- 
tentials include 1) compression of the auditory nerve 
and brain stem, 2) vascular impairment of the periph- 
eral cochleovestibular nerve, and transient or pro- 


longed hypoperfusion of the intra-axial pathways, 
and 3) damage of myelin consequent to primary neu- 
rologic disorders such as multiple sclerosis.!° 


In a few cases, absence of ABR activity has been 
reported in normally hearing individuals and those 
with minor impairments. !2-14 An explanation for this 
apparent conflict is predicated upon interruption of 
the synchronous neural flow between generators. 
Other theories that attempt to resolve this dilemma 
include 1) absent neural activity, cause unknown, 2) 
a blockade of nerve conduction based upon local dys- 
function, and 3) asynchronous responses or disrup- 
tion of synchronous impulses below measurement. 
For example, specific diseases such as multiple scle- 
rosis may cause selective demyelination and neuro- 
humoral dysfunction of synapses, which distorts the 
electrical activity. Other central nervous system dis- 
orders that result in expressive or receptive aphasia 
may additionally be associated with abnormal ABR 
data.!5 


Ischemic attacks of the vertebrobasilar arterial sys- 
tem are frequently associated with abnormalities in 
the IPLs or amplitude ratios.8-16-18 The data are some- 
what variable and extend from 37% to 65% abnor- 
mal.}617 Ferbert et al!’ localized vertebrobasilar ar- 
tery thrombosis by angiography and integrated the 
evoked potential with the radiographic data. They 
concluded that the absence of wave I, excluding pre- 
vious peripheral auditory damage, is a consequence 
of malperfusion through the anterior inferior cerebel- 
lar artery. Thereafter, prolonged I-III latencies were 
observed in caudal lesions of the basilar artery. Fur- 
ther along, a pathological IV-V complex was observed 
when the lesion was rostral to the anterior inferior 
cerebellar artery. They further emphasized the vari- 
ability of the ABR findings in the scenario of bilateral 
vertebrobasilar artery occlusion. These inconsistent 
findings can be due to extension, occlusion, or com- 
promise of the middle and upper basilar arteries, 
which result in a unique and inconsistent involvement 
damaging the pontine region. 


From the negative aspect, an attempt by Kraus et 
al!3 to localize the central site of hearing loss was 
limited by such a variety of data that little specific- 
ity resulted. By contrast, a report by Starr and Ham- 
ilton!2 that included data from cerebral damage due 
to neoplasm, infarction, and the encephalopathy of 
anoxia described a more precise localization of the 
lesion affecting the ABR. An ABR correlation with 
localized unilateral brain stem lesions confirmed by 
MRI was assessed by Markand et al.?° They deter- 
mined that the abnormalities were ipsilateral to the 
corresponding ear in 18 of 24 subjects, and some de- 
gree of lateralization could be established. The mes- 
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Fig 1. Responses from left brain stem neural activity are 
consistent with normal parameters. Neural activity from 
right ear is not measurable throughout 0- to 10-ms time 
frame. 


encephalic lesions, although thzy affect wave IV and 
V either by amplitude depress:on or IPL delays, are 
less well lateralized. 


METHODS 


The authors recognize the voluminous literature 
that defines and measures the early auditory evoked 
responses. We suggest several articles that provide a 
foundation for the current report.23.5-7,10,19,21,22 The 
techniques and equipment are briefly outlined in the 
following summary. 


The ABR test used the Bio-Logic Traveler LT Sys- 
tem (Mundelein, Illinois) and Telephonic TDH-39 
headphones (Newark, New Jersey). The study was 
performed in a quiet area in the reclining position 
and without sedation after the audiometric evalua- 
tion. The specifics of the test are those generally ac- 
cepted and are briefly summarized for completeness. 
We used a filtered click of 100-ms duration with am- 
plifier filtration setting (of 300 Hz to 3 kHz) and pre- 
sented at 90 dB (hearing level) at a rate of 19.1 Hz/s 
and in runs of 2,048 stimuli. 


DATA 


The data were derived from 52 individuals (28 men 
and 24 women) 45 to 84 years of age (mean, 61.1 
years). The age distribution by decade is as follows: 
6 patients in their 40s, 9 in their 50s, 17 in their 60s, 
19 in their 70s, and 1 in his 8Cs. The criteria for en- 
try into this study included 1} 1 or more acute ver- 
tiginous episodes without neurologic evidence of per- 
sistent disability; 2) normal neurologic findings; 3) 
audiologic survey with requirements of bilateral sym- 
metry of pure tone sensitivity (30 dB or better), and 
speech discrimination scores proportionate to the im- 
pact of high-frequency attenuation of presbycusis, 
ie, >76%; 4) electronystagmography; and 5) evalua- 
tion of the brain by MRI and oF the cerebral vascula- 
ture and cervical vessels by MRA. 
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Fig 2. Upper pattern is derived from right ear, and lower 
two runs from left. There are no early waveforms; ques- 
tionable activity consistent with wave V may be suggested 
by vertical hatch mark. 


Factors that precluded entry into the study included 
central nervous system neoplasms, radiographically 
confirmed vascular abnormalities compressing the 
brain stem, eg, basilar dolichoectasia and aneurys- 
mal malformations, asymmetric hearing loss with a 
greater than 15-dB difference, and speech discrimina- 
tion of less than 76%. When these criteria were ap- 
plied to the large pool of vertiginous individuals ex- 
amined in the 3 years of the investigation, the final 
52 subjects represented approximately 5% of the to- 
tal cohort surveyed for vestibular complaints. The 
typical subject in this investigation was considered 
reliable by the attending physician, was unimpaired 
between vertiginous episodes, and was functional 
with socially adequate levels of hearing. The abnor- 
malities of the ABR are listed according to the fol- 
lowing categories: 1) amorphous pattern (ie, no dis- 
cernible waveforms), noted unilaterally in 2 subjects 
and bilaterally in an additional 2 subjects; 2) absence 
of waves III, IV, and V, noted unilaterally in 1 sub- 
ject and bilaterally in 2 subjects; 3) absence of waves 
IV and V, recorded unilaterally in 1 subject and bi- 
laterally in 2 subjects; and 4) interaural latencies 
(wave V) that were delayed, by 0.38 ms in 1 subject, 
by 0.5 ms in 1 subject, and by 0.8 ms in 1 subject. 


ILLUSTRATIVE CASES 


Case 1 (Fig 1) was that of a 67-year-old insulin- 
dependent diabetic man who had an acute incapaci- 
tating vertiginous attack associated with nausea and 
vomiting. He was examined 48 hours after the epi- 
sode, at which time an ataxic gait with lateropulsion 
was noted; a rotary-type nystagmus was apparent in 
all fields of gaze. He was fully alert, communicated 
without limitation, and was relatively comfortable 
while sedentary. Auditory tests revealed symmetric 
presbycusis with satisfactory verbal discrimination. 


Case 2 (Fig 2) was that of an 83-year-old man who 
complained of positionally induced intermittent epi- 
sodes of vertigo within the previous 2 years. Tests for 
hearing levels revealed mild presbycusis (speech re- 
ception threshold of 25 dB sound pressure level on 
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the right and left), and the speech discrimination 
scores were 84% on the right and 80% on the left. 


DISCUSSION 


Vertigo is a common symptom of peripheral ves- 
tibular dysfunction that may spare the auditory sys- 
tem. Included in this category are several syndromal 
or isolated disorders such as vestibular Meniere’s dis- 
ease, recurrent vestibulopathy, vestibular neuritis, and 
neurovascular compression.23-25 Other disorders, such 
as Lermoyez’s syndrome and Tumarkin-type otolithic 
crises, may represent an atypical presentation of Me- 
niere’s disease without a cochlear component. The 
authors emphasize that a peripheral source of ver- 
tigo need not affect the ABR, because wave I, the 
measure of cochlear activity, is usually normal in this 
scenario. 


The central causes of vertigo may include the retro- 
cochlear areas in the brain stem and cerebellum. The 
causative factors encompass such disparate disorders 
as vascular-based infarction, neoplasm, inflamma- 
tory disorders, and demyelinating diseases. Obvious- 
ly, because the cerebellum is not assessed by the 
ABR, this study is not applicable to lesions in this 
region that can initiate vertigo. 


The abnormal responses cited in the data may be 
divided by the presence or absence of waves (unilat- 
eral or bilateral) and interaural peak delays, ie, inter- 
ference with neural transmission. The 13 subjects 
who failed the ABR exhibit 4 types of wave deficits 
and IPL differences extending from 0.38 to 0.8 ms. 
The majority of vertiginous subjects (75% in this 
study) demonstrated normal ABRs. We speculate that 
some of the vertiginous subjects with normal ABR 
patterns suffer peripheral vestibular or cerebellar dis- 
orders that do not have an impact upon the auditory 
tracts. The remaining cases in this cohort may repre- 
sent central lesions that are functionally limited to 
the vestibular tracts. 


In 4 of the 13 cases, there is either unilateral or 
bilateral absence of recorded activity. This evidence 
suggests failure at the initial level that eliminates cen- 
tral propagation along the ascending pathways. The 
waves following normal I and H, ie, HI to V inclu- 
sive, are not recorded in 3 of the 13 cases. This ob- 
servation implies an electrophysiological interrup- 
tion in the retrocochlear region adjacent to or directly 
involving the cochlear nuclei. Absence of wave IV- 
V is evident in 3 subjects with a normal early con- 
figuration, suggesting an interruption in the midbrain 
region. We also suspect that the underlying mecha- 
nism for bilateral versus unilateral impairment is con- 
tingent upon the more extensive lesional area in the 
former and normal activity on the responsive side in 


the latter. Having eliminated obvious structural dam- 
age by the MRI, we suggest that the pathophysiologi- 
cal mechanism for many of these vertiginous epi- 
sodes is transient vascular insufficiency in the micro- 
vasculature of the hindbrain. The volumetric flow, 
however, to the auditory neural complex is sufficient 
to allow normal hearing. This concept does not deny 
other potential sources of vestibular dysfunction such 
as viral infection, but corroborative evidence is rela- 
tively sparse. 


We reiterate the relationship of specific arteries in 
the brain stem with regional neural structures and the 
consequent waveforms for localization: waves I and 
Il, from the cochlea and nerve VII (internal auditory 
artery from anterior inferior cerebellar artery or basi- 
lar artery); wave III, in the pontomedullary region 
(anterior inferior cerebellar artery); and wave IV-V, 
originating in the vicinity of the lateral lemniscus at 
the inferior colliculus (superior cerebellar artery). 


There are occasional anomalies in the arterial dis- 
tribution of the brain stem because of developmental 
variations and collateralization between the branch- 
es of the basilar artery. On the basis of these factors, 
occlusive disease may display a wide spectrum of 
symptoms and inconsistent evidence of neurologic 
disease.! 


A parallel to the use of ABR is ENG, which iden- 
tifies specific parameters of the vestibular system. 
These include ocular tracking, spontaneity and type 
of nystagmus, positional components, visual suppres- 
sion, and calorically induced responsiveness. The 
data from ENG are purported to separate peripheral 
areas from central areas causing vertigo; this dichot- 
omy fails in the scenario of multiple vascular lesions 
that may simultaneously affect the inner ear and brain 
stem. 


Arecent report by Stoddart et al?6 confirms the lim- 
ited value of MRI toward identification of structural 
injury when the ENG indicates a central source of 
vertigo. They concluded that functional disturbances 
can be the underlying mechanism causing vertigo 
without structural damage. 


We examined each of the subjects in the study by 
ENG. The results were essentially normal or mildly 
abnormal and consistent with age, such as minor low- 
amplitude saccadic activity (under 2°) or delayed or 
diminished visual tracking. There was no evidence 
of spontaneous or positionally induced nystagmus 
in this cohort. Finally, we concluded that the caloric 
tests added little positive diagnostic information to- 
ward specific localization of the lesion. 


We have examined the correlation of MRI and 
MRA and ENG for localization of the symptom-pro- 
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ducing lesions. On MRI, approximately 10% of the 
subjects demonstrated small areas of infarction in the 
brain (3/52) or cerebellum (2/52); there was no evi- 
dence of significant medullary or pontine disease in 
any of the subjects. This finding is in accordance with 
the physical examination, which identified unim- 
paired brain stem function. We conclude that MRI 
has limited diagnostic value in the evaluation of iso- 
lated vertigo. Whatever evidence of cerebral damage 
was observed was considered to be of unknown du- 
ration and could not necessarily be correlated with 
the symptom of vertigo. Examination by MRA of the 
cervical and intracerebral arterial channels revealed 
7 significant abnormalities. These findings included 
occlusion or stenosis and were recorded in the verte- 
bral arteries (4 subjects), the basilar artery (2 sub- 
jects), and the internal carotid artery (1 subject). 


In summary, our data suggest that in a minority of 
subjects, the ABR can contribute to the evaluation 





of vertiginous episodes. The abnormal results tend 
to affirm the diagnosis of central vertigo as opposed 
to vertigo with a peripheral source when the early 
waveforms (I and II) are intact and the following 
waves are absent, the IPL is prolonged, or later wave 
morphologies and amplitudes are distorted. Finally, 
when positive data are evident, this information can 
be integrated into the neuro-otologic battery during 
the investigation of vertigo and result in additional 
localization of the source of the vestibular symptom. 
The various abnormal ABR patterns suggest multiple 
defective areas within the brain stem that interfere 
with the electrical transmission associated with audi- 
tory responses. Bilaterally abnormal patterns imply 
even more extensive dysfunctional areas. The obser- 
vation that the medulla, pons, and midbrain were nor- 
mal in the preponderance of individuals when viewed 
by MRI implies either regional physiologic functional 
blockades or neuronal tract injuries below radio- 
graphic resolution. 
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VASCULAR PERMEABILITY TO FLUORESCENT SUBSTANCE IN 
HUMAN CRANIAL NERVES 
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It has recently been reported that in the facial canal, the facial nerve shows enhancement on gadolinium-enhanced magnetic 
resonance imaging in patients with clinically normal facial nerves. However, the mechanism of this enhancement has not yet been 
sufficiently clarified. The present study investigated the permeability of blood vessels in human cranial nerves that were obtained 
from surgically treated patients. The patients received an intravenous injection of sodium fluorescein 45 minutes before nerve resec- 
tion. For histologic observation, the nerves were removed and frozen at —70°C, and the sections were then cut at 4-um thickness with 
a freezing microtome. The localization of the tracers was examined with a fluorescence microscope. Fluorescence was observed in 
the external nerve sheath and slightly in the endoneurium of these nerves, but was not observed within nerve fibers. These findings 
indicate that the vascular barrier in human peripheral nerves is incomplete. 


KEY WORDS — cranial nerve, fluorescein, gadolinium-enhanced magnetic resonance imaging, vascular permeability. 


INTRODUCTION 


It has been reported recently that the facial nerve 
in the facial canal shows enhancement on gadolini- 
um-enhanced magnetic resonance imaging (Gd-MRI) 
in patients with clinically normal facial nerves.!? En- 
hancement has also been observed in the intracaver- 
nous segment of the third to sixth cranial nerves, the 
gasserian ganglion, and the second and third divi- 
sions of the trigeminal nerve at the base of the skull.* 
The mechanism of these enhancements has not been 
sufficiently clarified. 

We have previously reported permeability of the 
blood-nerve barrier in rabbit cranial nerves by so- 
dium fluorescein,* and reported a distribution of fluo- 
rescein similar to that found in human cranial nerve 
regions showing gadolinium-diethylene-triamine- 
penta-acetic acid (Gd-DTPA) enhancement. The 
present study was performed to investigate the per- 
meability of blood vessels in the human cranial nerves. 


MATERIALS AND METHODS 


Human nerve specimens were obtained from pa- 
tients during head and neck surgery. A segment of 
the facial nerve was obtained during total parotidecto- 
my from a patient with parotid carcinoma who showed 
no facial nerve paralysis before operation. Segments 
of the hypoglossal nerve and the lingual nerve were 
obtained during hemiglossectomy, and a segment of 
the superior laryngeal nerve was obtained during to- 


tal laryngectomy. The patients received an intrave- 
nous injection of sodium fluorescein (Fluorescite, 5 
mL) about 45 minutes before the nerve resection. 
Informed consent was obtained from each patient. 


The nerve samples were embedded into resin with- 
out fixation, preserved at —70°C until use, sliced with 
a microtome at 4-um thickness, and observed under 
a fluorescence microscope. 


RESULTS 


In the facial nerve, intense fluorescence was ob- 
served in the external nerve sheath. A slight fluores- 
cence was also seen in the endoneurium, but no fluo- 
rescence was observed within the nerve fibers (see 
Figure). These findings were seen in the hypoglos- 
sal, lingual, and superior laryngeal nerves, as well. 


DISCUSSION 


The central nervous system contains the blood- 
brain barrier and the blood—cerebrospinal fluid bar- 
rier, selective barriers to the passage of materials from 
the blood to the brain and the spinal cord. The periph- 
eral nerves have a similar blood-nerve barrier. How- 
ever, numerous studies have indicated the absence 
of vascular barriers in the peripheral ganglia and ex- 
ternal nerve sheath, and that the barrier function of 
the endoneurial vessels is incomplete.> 


We previously examined the permeability of rab- 
bit cranial nerves using fluorescein. On the basis of 
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Fluorescence micrographs of human facial nerve after injection of sodium fluorescein (original x40). A) Longitudinal section. 
Intense fluorescence is visible in external nerve sheath, and weak fluorescence in endoneurium. B) Cross section. Fluores- 
cence is visible in external nerve sheath and endoneurium, but not within nerve fibers. 


our observations, it was hypothesized that the physi- 
ological enhancement of human cranial nerves seen 
on Gd-MRI is caused by this vascular permeability. 
However, there has been no research on the perme- 
ability of the blood vessels in the human peripheral 
nerves. 


In this study, we examined for the first time the 
vascular permeability of the human cranial nerves by 
using sodium fluorescein. This tracer has been used 
clinically for angiography of the ocular fundus. Its 
high solubility in water allows rapid diffusion into 
extracellular spaces, except in special regions consti- 
tuting a blood-brain barrier or a blood-retinal barrier.6 


The fluorescence intensity in the human periph- 
eral nerves such as the facial, hypoglossal, lingual, 
and superior laryngeal nerves was strong in the ex- 
ternal nerve sheath and weak in the endoneurium. 
These findings are identical to those in the rabbit, 
and seem to indicate that the barrier of the blood ves- 
sels in the human peripheral nerve is also imperfect. 


A paramagnetic agent, Gd-DTPA decreases the 
spin lattice relaxation time in tissues in which it ac- 
cumulates, resulting in improved contrast enhance- 
ment in MRI. Sodium fluorescein has a molecular 


weight of 376 and a negative charge, while Gd-DTPA 
has a molecular weight of 742 and is also negatively 
charged. Their similar distribution in the human cra- 
nial nerves offers a possible examination for the Gd- 
MRI enhancement observed in normal cranial nerves. 


The present study suggests that the normal facial 
nerve and trigeminal nerve can be outlined by con- 
trast MRI using Gd-DTPA because of the relatively 
ineffective barrier of the blood vessels of these cra- 
nial nerves for the contrast material. 


CONCLUSION 

The vascular permeability of human peripheral 
nerves such as the facial, hypoglossal, and lingual 
nerves was examined with a fluorescent tracer. In all 
of the nerves, strong fluorescence intensity was ob- 
served in the external nerve sheath, and the fluores- 
cence was weak in the endoneurium. These findings 
were identical to those in the rabbit, and seem to in- 
dicate that the barrier of the blood vessels in the hu- 
man peripheral nerves is also imperfect. It is hypoth- 
esized that the enhancement of the normal facial nerve 
and trigeminal nerve seen on Gd-MRI is caused by 
this vascular permeability of the blood vessels of these 
nerves. 


REFERENCES 


1. Gebarski SS, Telian SA, Niparko JK. Enhancement along 
the normal facial nerve in the facial canal: MR imaging and 
anatomic correlation. Radiology 1992:183:391-4. 

2. Schwaber MK, Larson TC III, Zealear DL, Creasy J. Gad- 
olinium-enhanced magnetic resonance imaging in Bell’s palsy. 
Laryngoscope 1990;100:1264-9. 

3. Kilgore DP, Breger RK, Daniels DL, Pojunas KW, Wil- 
liams AL, Haughton VM. Cranial tissues: normal MR appear- 


ance after intravenous injection of Gd-DTPA. Radiology 1986; 
160:757-61. 


4. Nakao Y, Sakihama N, Matsumoto K, Ochi M, Kumagami 
H. Vascular permeability to sodium fluorescein in the rabbit 
cranial nerve root: possible correlation with normal cranial nerve 
enhancement on gadolinium-enhanced magnetic resonance im- 
aging. Eur Arch Otorhinolaryngol 1994;251:457-60. 


5. Olsson Y. Microenvironment of the peripheral nervous sys- 
tem under normal and pathological conditions. Crit Rev Neuro- 
biol 1990;5:265-311. 


6. Aaborg TM. Principles and practice of ophthalmology. 
Philadelphia, Pa: Saunders, 1980:905-29. 











Ann Otol Rhinol Laryngol 111:2002 





ACOUSTIC ANALYSIS OF UPPER AIRWAY OBSTRUCTION IN THE 
EXCISED HUMAN LARYNX 
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Upper airway obstruction is an emergency that requires quick and decisive intervention. Stridor is the sound created by airflow 
through a partially obstructed airwzy, and has been described to vary with the site and degree of obstruction. This study sought to 
determine the sound characteristics of stridor in the excised human larynx. Five fresh cadaver human larynges were harvested and: 
subjected to obstructions at supraglottic, glottic, and subglottic subsites. Subglottic pressure, airflow, and audio signal were re- 
corded. Data were analyzed on the basis of laryngeal obstruction subsite and the degree of laryngeal resistance. Visual inspection 
demonstrated certain trends in peak spectral energy depending on the site and, more significantly, the amount of obstruction. Statis- 
tical analysis of spectral waveforms showed better correlation with the amount of obstruction than with the site of obstruction. In 
summary, the frequency distribution of stridor produced in an excised human larynx was influenced by the amount of laryngeal 


resistance, but not by the site of airway obstruction. 


KEY WORDS — acoustics, airway obstruction, Fourier analysis, stridor. 


INTRODUCTION 


Upper airway obstruction is an emergency that re- 
quires quick and decisive intervention. Stridor, the 
harsh sound created by airflow through an abnormal, 
partially obstructed airway, is a sign that often ac- 
companies upper airway discrders. The sound may 
represent a self-resolving, temporary condition, or a 
progressive, life-threatening situation. The stridor 
quality varies with the level of obstruction, and an 
experienced clinician can use auscultation in an at- 
tempt to determine the site and degree of obstruc- 
tion. This method, however, is highly subjective. A 
second technique, clearly mors invasive, involves en- 
doscopy. Direct laryngobronchoscopy may be used 
to explore the airway disorder, yet this procedure it- 
self may occasionally worsen the obstruction through 
the necessary trauma of instrumentation, especially 
in the narrow airways of an infant. 


The fact that auscultation may be used by a vet- 
eran clinician to diagnose the obstruction denotes that 
stridor, and its acoustic character, hold diagnostic in- 
formation. The subjective atility to analyze stridor 
requires significant experienze and familiarity with 
airway disorders. Attempts to more objectively ana- 
lyze the stridor frequently involve acoustic analysis. 
This technique has evolved over time from the ini- 
tial complicated analog spectrographic analysis to 
more sophisticated digital acoustic analysis. 


The fast Fourier transform (FFT) is a mathemati- 


cal method of expressing data in terms of discrete 
frequency and amplitude. The FFT has been estab- 
lished in the analysis of biomedical data, and lends 
itself well to acoustic research.! The FFT has been 
used in several studies of stridor to determine the 
utility of acoustic analysis in the clinical setting.26 
However, these studies are limited in the lack of stan- 
dardization of disease states. Coleman and Schechter? 
performed the FFT on data collected in an airway 
model consisting of rigid plastic pipe. Although re- 
producibility was good, the model lacked the char- 
acteristics of a biological tissue system. 


Since the time of Leonardo da Vinci, science has 
used excised human larynges for phonation research.8 
Human larynges, however, have not been used fre- 
quently in airway obstruction research. This model 
has both clear benefits and weaknesses. The tissue, 
although cadaveric, more closely resembles the in 
vivo condition than the tubing model of Coleman 
and Schechter.’ Also, reproducibility is improved as 
the excised cadaveric larynx allows for the creation 
of more precise obstructions, although limited by the 
natural variance in size of human larynges. 


Stridor is similar to phonation, in that each has a 
sound source and post-source modification. In the in 
vivo patient models, the stridor does involve post- 
source modification. In our study, however, the iso- 
lated larynx provides a source signal devoid of modu- 
lation. This serves to reduce the degree of variability 
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TABLE 1. CADAVER DEMOGRAPHICS 
Cause of Death 








Larynx Age (y) Sex 





1 79 M Malignant mesothelioma 

2 67 M Aortic aneurysm 

3 71 F Emphysema 

4 85 F Chronic obstructive pulmonary disease 
5 83 M Respiratory failure 





introduced by the size and variable configuration of 
the patient’s vocal tract. 


This study seeks to determine whether character- 
istic acoustic patterns can be demonstrated in upper 
airway obstructions in fresh excised human cadaver 
larynges. Both the location and severity of obstruc- 
tion were hypothesized to influence the acoustic spec- 
tra. The results of experimentation with excised hu- 
man larynges may then be compared to those of pre- 
viously published research using other model sys- 
tems. 


MATERIALS AND METHODS 


Five human larynges were harvested from nonpre- 
served cadavers obtained via the Anatomic Program 
of Virginia and the Anatomic Pathology Department 
of Eastern Virginia Medical School. The demograph- 
ics of the cadavers are listed in Table 1. The larynges 
were removed from the cadavers, including tissue 
from the level of the thyroid cartilage at the superior 
extent to a point approximately 8 cm below the cri- 
coid cartilage at the inferior extent. The esophagus 
was harvested along with the larynx and tracheal por- 
tion in order to avoid damage to the tracheoesoph- 
ageal common wall. Cervical soft tissues superficial 
to the intrinsic laryngeal musculature were removed. 
The larynges were grossly inspected for evidence of 
disease. (A sixth larynx was excluded because of ob- 
vious endotracheal tube trauma.) The larynges were 
refrigerated in buffered saline solution at 8°C (48°F) 
if they were to be used within 72 hours of harvest; 
otherwise, they were frozen for as long as 2 weeks. 
The frozen larynges were thawed at refrigerated tem- 
peratures. 


A small cutting bur was used to create bilateral 
holes in the thyroid cartilage at the level of the true 
and false vocal folds. Dissection was carried out with 
a blunt probe through these holes along the true and 
false vocal folds, creating a tract for placement of a 
balloon catheter. Two 4-F Fogarty embolectomy cath- 
eters (model 120804F, Baxter Healthcare Corpora- 
tion, Irvine, California) could then be placed through 
the thyroid cartilage into both true or false vocal folds 
to produce either a variable glottic obstruction or a 
variable supraglottic obstruction. 


The fresh larynges were suspended in a metal frame 


MICROPHONE DAT RECORDER 











BALLOON 
CATHETERS 


FLOWMETER SUCTION 


Fig 1. Schematic representation of experimental configu- 
ration. 


by heavy silk sutures placed through the thyroid car- 
tilage at the superior extent and around the last har- 
vested tracheal ring at the inferior extent. Figure | 
represents the experimental setup. A 7-0 cuffed en- 
dotracheal tube with a trimmed tip was inserted ret- 
rograde into the trachea and positioned below the cri- 
coid cartilage. This endotracheal tube was coupled 
to a vacuum source via a circuit of semirigid vinyl 
tubing, which included both a ball flowmeter and a 
venting valve to allow for measurement and fine- 
tuning of the airflow. Vacuum-generated airflow was 
felt to be more physiologic than airflow from a posi- 
tive pressure source (eg, compressed air) in this mod- 
el, because most stridor is inspiratory. An |8-gauge 
needle was introduced into the subglottis via the tra- 
cheal wall. This was connected to a pressure trans- 
ducer (model PX 186, Omega Technologies Company, 
Stamford, Connecticut) via arterial line tubing. Sub- 
glottic pressure was read with a digital voltmeter 
(Digital Multimeter model 310, Beckman Instrument, 
Inc, Fullerton, California) calibrated at 20 cm H20. 


The suspended larynx was placed in a sound- 
treated booth (Industrial Acoustics Company, Inc, 
Bronx, New York) with the vacuum source placed 
outside the booth in order to reduce environmental 
and confounding sound. Airflow was initiated with 
the vacuum source, and adjustments were made in 
flow and pressure. Obstruction at the glottis (true vo- 
cal folds) or supraglottic area (false vocal folds) was 
created by inflation of the embolectomy catheters. 
This obstruction could be varied by the amount of 
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Fig 2. Schematic diagram shows location of peaks in 
spectral waveforms for 3 sites of laryngeal obstruction. 
Number in, or near, box and relative height of box show 
number of waveforms with peak at that location. 


balloon inflation, as well as the use of interarytenoid 
stents. Similar experimental configurations were used 
by Yanagi et al? in the study of phonation in excised 
canine larynges. The obstruction was varied until a 
change in sound quality was appreciated. Acoustic 
data were recorded on a digital audiotape recorder 
(model TCD-D3, Sony Corporation, Tokyo, Japan) 
with a microphone (model 33-3003, Radio Shack, 
Fort Worth, Texas) positioned 2 cm above the glot- 
tis. The subglottic pressure and airflow measurements 
were recorded at each degree of obstruction. 


After data had been collected for the supraglottic 
and glottic obstructions, the larynx was removed from 
the suspension frame. Latex caulking compound was 
injected via the lumen of the trachea into the sub- 
glottic region with a Bruening Teflon syringe and an 
18-gauge Arnold needle. The injections were carried 
out circumferentially to create a stenosis of approxi- 
mately 50%. The larynx was then resuspended, and 
recordings of sound, pressure, and flow were made. 
The tip of the pressure transducer needle was posi- 
tioned below the stenosis. The obstruction was in- 
creased further via external subglottic compression 
of the trachea. 


Acoustic data recorded on digital audiotape were 
then sampled at 44.1 kHz and imported via a 16-bit 
sound card (Sound Blaster 16, Creative Labs, Inc, 
Milpitas, California) onto a personal computer. Cool 
Edit Pro (Syntrillium Software Corporation, Phoe- 
nix, Arizona) was used to sample the data, and Lab- 
view 4.1 (National Instruments Corporation, Austin, 
Texas) performed FFT analysis. The FFT peaks were 
noted both in number and frequency. Bandwidth was 
calculated as the range of frequencies encompassed 
between the shoulders of the peak. 


The airflow and subglottic pressure measured dur- 
ing the sound recordings were then used to calculate 
the laryngeal resistance. This was done with a modi- 
fied Ohm’s law equation where: Resistance (cm H20 
e s/L) = Pressure (cm H20)/Airflow (L/s). 


RESULTS 


The results of the FFT analyses were graphed with 
frequency versus magnitude. A separate graph was 
made for each larynx at various flow rates and de- 
grees of obstruction (ie, resistances). The flow rates 
generated through the larynges varied from 12 to 40 
L/min, and the subglottic pressures varied from —1 
to —92 cm H20 relative to atmospheric pressure. The 
resultant laryngeal resistance ranged from less than 
1 cm H20 « s/L to 319 cm H20 « s/L. 


Thirty-four experimental trials involved various 
amounts of glottic obstruction. Twenty-one of the 
graphs (62%) demonstrated a frequency peak in the 
low frequencies from 375 to 881 Hz (mean, 586 Hz). 
These peaks tended to be relatively broad, with a mean 
bandwidth of 906 Hz. Twenty-one of the experimen- 
tal trials (62%) also showed a peak between 4,000 
and 7,900 Hz (mean, 5,174 Hz). This second peak 
tended to be broader, with a mean bandwidth of 2,264 
Hz. Six graphs showed similar broad peaks in the 
range of 2,960 to 4,000 Hz (mean, 3,305 Hz). Eight 
trials demonstrated intermediate peaks between 1,200 
and 1,850 Hz (mean, 1,522 Hz). Eleven trials had a 
single peak within | of the above ranges. The remain- 
der had peaks in 2 of the groups. These findings are 
summarized in Fig 2. 


Twenty-six experimental runs were made on the 
larynges with supraglottic obstructions. Sixteen 
graphs (62%) demonstrated a low-frequency peak be- 
tween 375 and 780 Hz (mean, 536 Hz). Eighteen 
(69%) also showed a second peak between 4,000 and 
7,000 Hz (mean, 5,000 Hz). Five trials had second- 
ary peaks between 2,400 and 4,000 Hz (mean, 3,121 
Hz), and 9 had peaks between 1,000 and 2,000 Hz 
(mean, 1.332 Hz), all with a higher peak in the above- 
described ranges, as seen in Fig 2. 


The subglottic obstruction group contained 27 tri- 
als. Twenty-three of the trials (85%) had a distinctive 
peak between 4,000 and 5,500 Hz (mean, 4,414 Hz). 
Similar to the other obstruction sites, 16 (60%) dem- 
onstrated peaks between 375 and 800 Hz (mean, 552 
Hz). Eight trials showed mid-frequency peaks be- 
tween 1,200 and 2,000 Hz (mean, 1,452 Hz). The 
locations of these peaks are shown schematically in 
Fig 2. 

If all trials, regardless of site of obstruction, are 
compared as a single group, patterns are visually ap- 
parent. Systems with laryngeal resistance of less than 
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Fig 3. Glottic obstruction in larynx 1. A) With laryngeal resistance of 5.0 cm H20 + s/L (peaks, 1,389 and 5,594 Hz). B) With 
laryngeal resistance of 83.92 cm H20 « s/L (peaks, 630 and 4,732 Hz). 


20 cm H20 ° s/L (N = 24) had peaks between 1,000 
and 1,900 Hz in 17 of 24 instances (71%); Fig 3A is 
a representative FFT graph. As laryngeal resistance 
increased above 20 cm H20 « s/L.(N = 53), the peak 
between 1,000 and 1,900 Hz, when present, disap- 
peared in 86%. A low-frequency peak appeared be- 
tween 375 and 800 Hz (when it had not been present 
at lower resistance) in 74% of trials; Fig 3B repre- 
sents this FFT pattern. With a further increase of la- 
ryngeal resistance above 20 cm H20 e s/L, or in- 
crease of laryngeal airflow, no additional trends could 
be demonstrated in the location of frequency peaks, 
the addition of secondary peaks, or the increase in 
baseline frequency (ie, decreased signal-to-noise ra- 
tio). 


Correlation analysis was performed on the wave- 
forms generated by FFT. For each anatomic. subsite, 
the correlation between waveforms within the ana- 
tomic group was determined and the mean correla- 
tion was calculated (mean of all possible pairs of 
spectra). Similar cross-correlations were also calcu- 
lated between anatomic groups; eg, the waveforms 
for glottic obstruction were correlated with subglot- 


‘tic obstruction, subglottic with supraglottic, etc. The 


means were again calculated. The results are shown 
in Table 2. 


By use of these average correlations, it is then pos- 
sible to determine whether the average correlation 
within a given anatomic group is greater than the cor- 
relation between it and another anatomic group. If 
so, this would imply that the waveforms in that sub- 
site of obstruction have a characteristic shape that 


TABLE 2. AVERAGE CORRELATION BETWEEN 
WAVEFORMS BY OBSTRUCTION SITE 


Site of Supraglottic Glottic Subglottic 
Obstruction (N = 26) (N = 34) (N = 27) 
Supraglottic 0.908+0.050 0.915+40.045 0.913 40.047 
Glottic 0.93140.039 0.926 + 0.042 
Subglottic 0.928 + 0.040 

Data are mean + SD. 














differs from the waveforms in another subsite. 


The average correlation of the glottic waveforms 
with themselves was statistically greater (p < .0001) 
with the Student t-test than was the correlation with 
either subglottic or supraglottic obstruction. For sub- 
glottic obstruction, the average correlation within the 
group was greater (p < .0001) than the correlation 
with the supraglottic group, but it was not different 
from that with the glottic group (p = .2). For supra- 
glottic obstruction, the difference between the corre- 
lation within its group and the correlation with glot- 
tic obstruction was significant (p < .0001), but the 
difference between the within-group correlation and 
the correlation with the subglottic group was not sig- 
nificant (p = .2). 


A similar set of calculations was performed in 
which waveforms were grouped in terms of the laryn- 
geal resistance present during data collection, but in- 
dependent of obstruction site. Categories of airflow 
resistance were created: low, or 0 to 20 cm H20 ° s/L; 
medium, or 20 to 50 cm H20 ° s/L; and high, or >50 
cm H20 « s/L. These results are shown in Table 3. 


As with the groups based on site of obstruction, 
the average correlations within each of these groups 
were compared with the averages when correlated 
with the other airflow resistance groups. The aver- 
age correlation within an airflow resistance group 
was significantly higher (p value much less than .05) 
than the correlation between that group and one of 
the other resistance groups. The only exception to this 
finding was the comparison between high-high re- 
sistance and medium-high resistance. No clear rea- 


TABLE 3. AVERAGE CORRELATION BETWEEN 
WAVEFORMS BY AIRWAY RESISTANCE 














Airway Low Medium High 
Resistance (N = 23) (N = 27) (N = 37) 
Low 0.938 + 0.034 0.914+0.041 0.913 + 0.047 
Medium 0.927 + 0.038 0.923 + 0.045 
High 0.918 + 0.049 


Data are mean + SD. 
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son for this finding is evident, but there was greater 
variability within the high-resistance data, which 
could have lowered the within-group correlation val- 
ue. 


DISCUSSION 


Practicing otolaryngologists use auscultation as an 
important portion of patient evaluation and diagno- 
sis. The hyponasality of nasal obstruction and the 
“hot potato” voice of a peritonsillar abscess are fa- 
miliar clues to the level of the zirway disorder. Along 
this line, some clinicians draw conclusions regard- 
ing the level of laryngeal disorders based on the qual- 
ity and respiratory phase components of stridor. Stri- 
dor, derived from the Latin, means creaking or harsh 
sound. The presence of stridor during respiration usu- 
ally indicates airway obstruction in the larynx or prox- 


_ imal trachea. Cotton and Reilly!° classified stridor - 


as supralaryngeal, supraglottic, glottic, subglottic, or 
tracheal. Stridor may also be classified by the phase 
of respiration in which it is present: expiratory, in- 
spiratory, or biphasic. The quality of the stridor, its 
severity, and its improvement may yield information 
regarding the severity of the obstruction, the possi- 
bility of impending airway collapse, and the response 
to treatment. However, these are very subjective ob- 
servations. 


Researchers have investigated noninvasive tech- 
niques for evaluating the location and degree of air- 
way obstruction. Flow-volume2 curves can be useful 
in assessing intrathoracic versus extrathoracic ob- 
struction. Classically, the variable upper airway ob- 
struction demonstrates flattering of the inspiratory 
loop, with less effect on the expiratory phase. The 
acoustic reflection technique has been used for as- 
sessment of the airway dimz2nsions and obstruc- 
tions.1!12 High-frequency. sound is introduced into 
the airways, and a microphone records the reflected 
sound. Signal processing allows for the calculation 
of airway cross-sectional area. 


Acoustic analysis provides a method of investi- 
gating sound data and numerically and graphically 
representing the information. One of the earliest stud- 
ies to use acoustic analysis to evaluate airway sounds 
was by Hirschberg!3 in 1980. Ey analog spectral anal- 
ysis, types of stridor were characterized by their sound 
variation over time, and to a lesser extent by their 
constituent frequencies. Pharyngeal stridor was noise- 
like, with frequent breaks; laryngeal stridor was more 
“musical” in nature, with overtones; subglottic and/ 
or tracheal stridor had a hollow timbre; and bron- 
chial stridor had a noiselike quality and intense fre- 
quencies at 2 kHz. 


The FFT is a mathematical technique for analysis 
of signals in the frequency domain. It represents the 


frequency density of an acoustic signal.! Gray et al? 
published preliminary data from use of this technique 
to analyze 3 infants with stridor. They found low- 
frequency peaks at about 500 Hz in pathological air- 
ways, which resolved with improvement of the ob- 
struction in 2 patients. Laryngomalacia demonstrated 
a single broad peak at 500 Hz, whereas subglottic 
lesions demonstrated either a single peak at 500 Hz 
or double peaks at 500 and 900 Hz. Rao et al,> in 
1990, used FFT to study tracheostomy tube obstruc- 
tion in 10 adult patients. This study found that the 
baseline frequency at less than 300 Hz shifted in oc- 
cluded tubes to 1,000 to 1,600 Hz. Although this study 
looked strictly at tracheostomy tubes, parallels could 
be drawn to natural anatomic airway obstructions. 
Coleman and Schechter’ investigated this issue in 
rigid plastic pipe models of the airway. They found a 
pattern of increasing high-frequency signals (>3 kHz) 
as obstruction increased beyond 60% to 80%. They 
concluded that the obstruction site’s effect on signal 
frequency was mostly dependent on natural resonance 
frequencies based on tube length. 


Leiberman et al’ further investigated this idea us- 
ing a technique known as power spectral density. This 
technique is similar in concept to FFT, yielding the 
relative signal strength of each frequency, but taking 
into account the variability of the signal over time, 
especially with inspiration and expiration. Only ex- 
ample cases of obstruction were presented, and low 
frequencies predominated in the airway obstructions, 
as in the studies using FFT. 


Our study was an attempt to analyze the acoustic 
characteristics of laryngeal obstructions of various 
degrees and locations. The excised fresh human ca- 
daver larynx was chosen because it was more physi- 
ologic than previously studied tube models,’ yet more 
reproducible than the limited number of published 
patient series. These latter series are plagued by sig- 
nificant and unpredictable differences in anatomic 
level of obstruction and severity. 


The FFT analysis of the data obtained reveals the 
consistent appearance of low-frequency (300 to 800 
Hz) signals with obstruction. Nonobstructed “con- 
trol” and low-grade—obstructed larynges (resistance 
of less than 20 cm H20 e s/L) demonstrated peaks at 
1 to 2 kHz in addition to high-frequency signals at 4 
to 7 kHz. This 1- to 2-kHz peak was less frequent as 
laryngeal resistance rose above 20 cm H20 « s/L. It 
was at this same point that the 300- to 800-Hz peaks 
appeared. 


On visual inspection, there was significant varia- 
tion both within and between larynges, even at the 


_ same site of obstruction. Factors that can be used to 


explain this variation include structural variations 
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within normal larynges and anatomic laryngeal dif- 
ferences between genders. To a lesser extent, some 
degree of tissue desiccation despite preventive ef- 
forts may have contributed. Larger numbers of la- 
rynges would allow for the control of some of these 
factors and increase the power of any differences seen 
in the data. 


Statistical analysis of waveforms showed less vari- 
ation than could be appreciated with visual inspec- 
tion. When the waveforms were categorized in terms 
of anatomic site of obstruction or amount of airflow 
resistance, the degree of correlation was actually 
quite high (at least 0.911). These results were some- 
what surprising, especially in the case of obstruction 
categories. As mentioned above, certain characteris- 
tic patterns were seen at each site, but the findings 
were far from uniform or distinctive. 


In agreement with our visual inspection, categori- 
zation based on the degree of obstruction (and thus 
resistance) produced more distinct groups than did 
categorization based on the site of obstruction. In all 
but a single case, the average correlation within an 
airway resistance group was statistically higher than 
the average correlation with the other groups. 


No other studies found in the English-language 
literature made use of nonpreserved excised human 
cadaver larynges for investigation of airway obstruc- 
tion. Several studies have investigated phonation ex 
vivo with human and canine larynges. Our work 
shows that the excised larynx can be used for airway 
obstruction research, and acoustic results can be ob- 
tained by use of vacuum systems to generate airflow. 


The results of this study do vary from the pub- 
lished patient studies, as would be expected with the 
isolated larynx model used. Similar low-frequency 
peaks were seen in all sites of obstruction, similar to 
the previously discussed series, but no significant dif- 
ference was seen in frequency distribution between 
levels of obstruction. The site of laryngeal obstruc- 
tion did not appear to affect the FFT spectra as much 
as the presence of increased laryngeal resistance. The 
inflection point of 20 cm H20 e s/L in our study is 
consistent with other research that has shown nor- 





mal laryngeal resistance to be 1 cm H20 « s/L and 
symptomatic obstruction to begin at upwards of 15 
cm H20 e s/L.!* Unpublished data from the senior 
author (R.L.P.) indicate that vocal tract formants, de- 
termined by vocal tract filtering, seem to play more 
of a role in the acoustic characteristics of stridor than 
does the exact sound source. 


The goal of this study was to determine the acous- 
tics of the actual source signal in airway obstruction. 
Stridor production is similar to phonation in terms 
of basic acoustic theory. Both involve a source and 
post-source modification, or filtering. The source-fil- 
ter theory is a linear model of speech production de- 
veloped in the 1950s. It describes voice production 
as involving a quasiperiodic pulsed source (vocal 
folds), varying in pitch and amplitude, and a linear 
filter- (vocal tract).!5:16 This theory may also describe 
stridor production in the in vivo situation. The stri- 
dor source is presumed to be the tissue at the level of 
obstruction, with the vocal tract providing modula- 
tion of the signal. Most research has investigated the 
signal after modification by the vocal tract. Our mod- 
el of the excised larynx allows for examination of 
the stridor source without the variable vocal tract fil- 
tering. Thus, the significant modification of sound 
that occurs in the upper airways (supraglottis, phar- 
ynx, nose, and mouth) in living patients is avoided. 


CONCLUSION 


Stridor is a pathological noise created by airway 
obstruction. Various noninvasive techniques have 
been used to investigate this phenomenon, includ- 
ing acoustic analysis. This study attempted to use a 
reproducible, yet more physiologic, model for this 
investigation of the isolated stridor source. Changes 
in anatomic site of obstruction did not produce visu- 
ally recognizable distinct differences in the frequency 
distribution of the resulting stridor. However, changes 
were seen in the frequency curves as laryngeal air- 
way resistance increased from normal values to lev- 
els of pathological airway obstruction. Further re- 
search with larger numbers of larynges is needed to 
investigate the role of acoustic differences in vari- 
able obstruction, as well as the role of FFT in this 
model of research. 
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CISPLATIN-INDUCED OTOTOXICITY AND PHARMACOKINETICS: 
PRELIMINARY FINDINGS IN A DOG MODEL 
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The early effects ofa clinical dose of cisplatin (100 mg/m?) on distortion-product otoacoustic emissions (DPOAE) thresholds and 
the relationship between DPOAE threshold shifts and changes in plasma concentrations of filterable and total platinum (Pt) follow- 
ing infusion of cisplatin in a dog model were investigated. The DPOAE thresholds (based on input-output function) were measured 
2 days before a single high dose of cisplatin administration, and compared with measurements recorded 2 and 4 days after infusion. 
The results revealed DPOAE thresholds to be elevated by 4 days after the administration of cisplatin. However, this elevation could 
not be correlated with plasma concentrations of filterable and total Pt, which showed little variation over the 48-hour postinfusion 
period between animals. The present study demonstrated that DPOAE thresholds have the potential to be used as an indicator of 
cisplatin-induced ototoxicity, and cisplatin-induced ototoxicity could not be explained by plasma Pt kinetics in individual animals. 


KEY WORDS — cisplatin, distortion-product otoacoustic emissions, otoacoustic emissions, ototoxicity, pharmacokinetics. 


INTRODUCTION 


Ototoxicity is a well-known side effect of the anti- 
neoplastic agent cisplatin. The site of ototoxicity is 
centered mainly in the outer hair cells of the co- 
chlea.12 Traditionally, pure tone audiometry has been 
used by clinicians to detect and monitor cisplatin- 
induced ototoxicity in humans.3 Recently, however, 
clinicians have advocated the use of otoacoustic emis- 
sions (OAEs) to measure cisplatin-induced ototoxic- 
ity, as they not only provide a quick and objective 
means of assessing cochlear function, but they can 
also identify ototoxicity earlier than conventional pure 
tone audiometry. There are reports of reduction of 
transient-evoked OAE (TEOAE) and distortion-prod- 
uct OAE (DPOAE) amplitude*? following cisplatin 
chemotherapy in cancer patients. 


Animal studies attempting to investigate the mech- 
anism(s) of cisplatin ototoxicity have used doses of 
cisplatin that are difficult to compare with those used 
clinically in humans. In addition, the mode of deliv- 
ery of cisplatin (subcutaneous, intramuscular, or in- 
traperitoneal) in most animal studies differs markedly 
from the way the drug is administered to humans (slow 
intravenous infusion). The extrapolation of findings 
from these animal studies to humans has, therefore, 
been difficult. Further, individual variability in sus- 
ceptibility to cisplatin-induced ototoxicity has been 
reported in animal experiments by some research- 
ers.10.11 Whereas studies relating plasma concentra- 
tions of total platinum (Pt) to ototoxic changes!! un- 





successfully attempted to explain this variability, there 
have been suggestions that it could possibly be ex- 
plained by interindividual differences in plasma con- 
centrations of filterable rather than total Pt.!2 So far, 
there has been just one report of pharmacokinetic data 
on filterable Pt (the pharmacologically active form of 
the drug) in a guinea pig model.!3 In the present study, 
a dog model was used to delineate the early effects of 
cisplatin on DPOAEs, and to investigate the relation- 
ship between cisplatin ototoxicity and the pharmaco- 
kinetics of Pt in individual animals. The dog has sev- 
eral advantages over small mammalian species (rats, 
guinea pigs, and gerbils) in that dogs receive routine 
cisplatin chemotherapy at levels similar to those ad- 
ministered to humans for the treatment of a variety 
of carcinomas,!4 and they are suitable for multiple 
blood samplings for plasma Pt analysis because of 
their relatively large blood volume. In addition, valid 
OAE responses!>:!6 have been recorded in them by 
means of commercially available equipment. 


The aims of this study were 1) to investigate the 
early effects of a single clinical dose of cisplatin (100 
mg/m?) on the DPOAE thresholds in dogs and 2) to 
investigate whether changes in DPOAE thresholds 
could be explained by changes in plasma concentra- 
tions of total and filterable Pt. 


MATERIALS AND METHODS 


Animals. Five mixed breed dogs obtained from the 
City Council pound were used (Table 1). Each dog 
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TABLE 1. DETAILS OF DOGS 
Dog Breed Sex Age(mo) Weight (kg) 











l Cattle dog cross M 9 17:8 
2 Bull terrier cross M 12 18 
3 Rottweiler cross M 6 19 
4 Dachshund cross F 8 16 
5 Bull terrier cross F 12 19.4 





was wormed with Drontal (each tablet contains 175 
mg praziquantel, 174.4 mg pyrantel embonate, and 
875 mg Febantel; Bayer Australia Ltd, Pymble, Aus- 
tralia) and vaccinated against canine distemper, hepa- 
titis, parvovirus, and parainfluenza virus infection 
(Canvac 4 in 1, Commonwealth Serum Laboratories, 
Brisbane). They were housed in individual cages in 
an air-conditioned room set at 22°C to 23°C through- 
out the study and fed commercially available dry and 
canned dog food. 


Audiological Measurements. Bilateral otoscopy, 
tympanometry, and auditory brain stem response 
(ABR) audiometry were performed before OAE mea- 
surements to rule out external and middle ear dis- 
ease and hearing dysfunction. Baseline ABR and OAE 
recordings were made 2 days before cisplatin admini- 
stration. Only dogs with 1) type A tympanograms,!7 
2) ABR thresholds equal to or less than 20 dB nor- 
mal hearing level, and 3) replicable OAE recordings 
were included in the study. The ABR and OAE re- 
cordings were repeated 2 days and 4 days after the 
administration of cisplatin, and the data were com- 
pared with baseline measurements. 


Otoscopy was performed with a handheld Welch 
Allyn otoscope (Welch Allyn, Skaneateles Falls, New 
York). Tympanometry was performed with a Zodiac 
901 middle ear analyzer (Madsen, Taastrup, Den- 
mark) with a 220-Hz probe tone. 


The dogs were fasted for 12 hours before the ABR 
and OAE recordings (and cisplatin infusion) to mini- 
mize nausea and vomiting. They were then premedi- 
cated with intravenous acepromazine maleate (0.05 
mg/kg; Promex 2, 2 mg/mL, Apex Laboratories Pty 
Ltd, St Marys, Australia) 15 minutes before induc- 
tion with thiopental sodium (12 mg/kg; Thiobab, 25 
mg/mL, Jurox Pty Ltd, Silverwater, Australia). Af- 
ter endotracheal intubation, the dogs were maintained 
on 1.5% halothane and oxygen, and placed on a heat- 
ing mat and covered with a blanket. The rectal body 
temperature was monitored and maintained at 0.2°C 
to 0.7°C below preanesthetic levels during the record- 
ings. 

Auditory brain stem response thresholds were mea- 
sured with a Medelec Sapphire system (Oxford In- 
struments, Old Woking, England). Parameters typi- 
cally used for human recordings were recorded in 


the dogs as per previously described procedures. 16-18 
We recorded DPOAEs bilaterally with a commercial- 
ly available ILO92 OAE analyzer (Otodynamics Ltd, 
Hartfield. England) connected to an IBM 486 DX 
computer. The DPOAEs (2F1-F2) were recorded 
twice without altering the probe position to ensure 
that the responses were repeatable. The DPOAE 
thresholds were measured with the input-output 
(growth) function at 2, 4, and 6 kHz to equilevel pri- 
maries (L1/L2: 55/55 dB sound pressure level [SPL]) 
over 28 seconds (ie, 3 sweeps). The ratio of F2 to F1 
was 1.22. The stimulus level of the primary tones 
was varied from 55 dB SPL to 25 dB SPL. The low- 
est level (of F2 in decibels SPL) at which a DPOAE 
could be recorded 3 dB above 2 SD of the mean noise 
floor at a particular frequency was taken as the 
DPOAE threshold at that frequency. 


In the present study, recordings were made on the 
second day after infusion in an attempt to capture 
the early ototoxic effects of cisplatin, as Komune et 
al!’ reported a complete loss of hearing in some guin- 
ea pigs on the second day after a single high dose of 
cisplatin (12.5 mg/kg administered intravenously). 
The same recordings were repeated on the fourth day 
after infusion to investigate any delayed ototoxic re- 
sponse shifts. Bilateral OAE recordings were made 
in each dog. An analysis of variance for repeated mea- 
sures was performed with SPSS software (version 
7.5 for Windows) to compare the DPOAE thresh- 
olds of the 10 ears recorded 2 days and 4 days after 
cisplatin infusion with those recorded 2 days before 
the infusion. A significance level of p < .05 was set a 
priori. 

Cisplatin Administration. Cisplatin (5 mg in 21 mL 
0.9% sodium chloride; Cytomix, Victorian Hospital 
Association Trading Company, Melbourne, Austra- 
lia) was infused intravenously at 100 mg/min? over 
20 minutes at a rate of 25 mL/kg per hour. The dog 
was hydrated with 0.9% saline solution at a rate of 
25 mL/kg per hour for 3 hours before cisplatin infu- 
sion and for | hour after the infusion. 


Blood Sampling. Blood samples (2.0 mL) were 
collected from the cephalic vein, which had been can- 
nulated with a Surflo (Terumo Corporation, Tokyo, 
Japan) intravenous catheter. These had been collected 
in heparinized syringes at 0, 5, 10, 20, 30, 45, and 60 
minutes and 2, 4, 6, 10, 24, 36, and 48 hours after 
the commencement of the cisplatin infusion. Samples 
taken after 4 hours after cisplatin infusion were col- 
lected from a jugular vein by venipuncture. The blood 
samples were immediately placed on ice to minimize 
cisplatin inactivation and were centrifuged (centri- 
fuge model 113, Sigma, Osterode an Harz, Germany) 
for 6 minutes at approximately 1,600g in a 4°C cold 
room. 
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Platinum Analysis. The total plasma cisplatin con- 
centration was determined on 0.1 mL plasma, and the 
filterable fraction (the pharmacologically active form 
of the drug) was measured by centrifuging 0.4 mL 
of plasma in ultrafiltration filters (Millipore Ultra- 
free 10000 NMWL, Bedford, Massachusetts) for 30 
minutes at approximately 1,600g at 4°C. All samples 
were stored at —70°C until assayed for Pt content. 
The plasma samples were diluted 10-fold in high- 
purity water with the addition of indium 100 pg/L as 
the internal standard and then vortexed to achieve 
solution homogeneity in readiness for Pt analysis. 
Inductively coupled plasma mass spectrometry (ICP- 
MS) based on a method reported by Taylor et al? 
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Fig 1. Plasma concentration levels of total (open circles) and 
filterable (filled circles) platinum (Pt) in dog 1 to dog 5 over 48 
hours following infusion of cisplatin 100 mg/m2. 


was used to determine Pt concentration. The instru- 
ment was a VG PlasmaQuad (VG Elemental, Wins- 
ford, England) in standard configuration with a Mein- 
hard nebulizer and a double-pass, water-cooled boro- 
silicate spray chamber. All ICP-MS measurements 
were determined in peak-jumping mode by selection 
of mass Pt (33.8% natural abundance) together with - 
indium (95.7% natural abundance) as the internal 
standard. An external calibration technique using ma- 
trix-matched standards coupled with internal stan- 
dardization was used for quantification. With the vol- 
umes of sample available, the minimum quantifiable 
Pt concentration was set to 25 ug/L in plasma and 
the plasma ultrafiltrate and to 0.1 ug/g dry mass of 


748 


DPOAE Threshold (dB SPL) 





w 
is) 


nm 
œ 














2 days after infusion 4 days after intusion 


before infusion 


Fig 2. Box plot shows distribution of distortion-product 
otoacoustic emission (DPOAE) thresholds at 6 kHz (L1/ 
L2 = 55/55 dB sound pressure level [SPL]) before infu- 
sion and 2 and 4 days after infusion of cisplatin. Lines at 
top and bottom of box (whiskers) represent largest and 
smallest values. Shaded area of rectangular box repre- 
sents interquartile range, and horizontal line in box rep- 
resents median. 


tissue (coefficient of variation. <5%). 


Pharmacokinetics. We plotted plasma concentra- 


tions of total Pt, as well as filterable Pt, against time 


to 


visualize the changes in concentration over the 


48-hour period. The serum concentration—time data 
for filterable Pt (the pharmacologically active spe- 
cies) were fitted to a biexponential model with the 
NONMEM program.?! Data obtained during the in- 
fusion period were incorporated in the kinetic analy- 
sis. 


Postmortem OAE Measurements. Afterthe DPOAE 


recordings, the dogs were euthanized by intravenous 
injection of Lethabarb (325 mg/mL of pentobarbital 
sodium, Virbac Australia Pty Ltd, Peakhurst, Aus- 
tralia). We measured DPOAESs to equilevel prima- 
ries 10 minutes after death to ascertain the biologi- 
cal validity of OAE responses. 


The study was conducted in accordance with the 


principles of laboratory animal care as required by 
the University of Queensland Animal Experimenta- 
tion Ethics Committee. 


RESULTS 
Platinum Analysis. The plasma concentration-time 
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profile of free and total Pt in individual dogs over 48 
hours after administration of cisplatin is shown in 
Fig 1. The concentrations of filterable Pt, as well as 
the total plasma Pt, reached a peak at 20 minutes 
after commencement of the cisplatin infusion. Filter- 
able Pt displayed log-linear decay with a rapid ini- 
tial phase (5 dogs; mean half-life, 6 minutes; range, 
2.9 to 7.0 minutes) followed by a slower, second 
phase (4 dogs; mean half-life, 15.8 hours; range, 7.98 
to 19.4 hours). In dog 2, data could only be obtained 
for the initial phase because of a very rapid rate of Pt 
elimination. 

DPOAE Threshold Shifts Following Cisplatin In- 


fusion. The analysis of variance for repeated mea- 


sures among days showed a significant difference (p 
< .05) for DPOAE thresholds at 6 kHz (Table 2 and 
Fig 2). Post hoc, pairwise comparison by the least 
significant difference test revealed a significant dif- 
ference between DPOAE thresholds before infusion 
and those 4 days after infusion (p < .05), and be- 
tween those 2 days after infusion and those 4 days 
after infusion (p < .01; Table 2 and Fig 2). 

Postmortem OAE and ABR Measurements and 
Necropsy. Postmortem TEOAEs and DPOAEs to 
55/55 dB SPL primary tones 10 minutes after death 
were absent. This finding demonstrated that the OAE 
responses recorded in the present study were indeed 
biological in origin and that they reflected the more 
active process of the cochlea. 


DISCUSSION 

In the present study, a single dose of cisplatin, com- 
parable to that used clinically, was used to investi- 
gate the early effects of cisplatin on DPOAE thresh- 
olds in dogs. A significant elevation in DPOAE thresh- 
olds at 6 kHz was found at 4 days after infusion of 
cisplatin. No increase in DPOAE thresholds was 
found 2 days after infusion as compared to the prein- 
fusion baseline thresholds. The DPOAE thresholds 
recorded 4 days after infusion also were significantly 
elevated from those recorded at 2 days after infusion. 


An examination of the raw data revealed that the 
dramatic elevation of DPOAE thresholds recorded 
in both ears of dog 3 and in the left ears of dogs 4 
and 5 at 6 kHz contributed most to the significant dif- 
ference at 4 days after infusion. Interanimal variabil- 
ity in susceptibility to cisplatin-induced ototoxicity 


TABLE 2. DISTORTION PRODUCT OTOACOUSTIC EMISSION THRESHOLDS RECORDED BEFORE INFUSION AND 
TWO AND FOUR DAYS AFTER INFUSION OF CISPLATIN 











F2 n Before Infusion 2 Days After Infusion 4 Days After Infusion F Value p 
2 kHz 8 40.8 + 8.04 41.7 +6.94 38.6 + 5.26 F(2,6) = 0.808 489 
4kHz 10 39.2 + 9.07 37.9 + 11.44 37.9 + 5.00 F(2,8) = 0.139 872 
6kHz 10 26.8 + 3.30 25.6 + 0.60 31.25.13 F(2,8) = 5.010 <.05* 


Data are mean + SD in decibels sound pressure level. 
*Significant difference on analysis of variance for repeated measures. 
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in animal studies has been widely reported.}®!! De- 
Spite initial suggestions that this variability could be 
related to variability in the plasma concentrations of 
filterable Pt between individual animals,!2 the find- 
ings of the present study, as well as that of the recent 
study by Ekborn et al,!3 have shown otherwise. There 
was little variation in the peak plasma concentrations 
of filterable Pt between individual animals, and there- 
fore, the observed elevation of DPOAE thresholds 4 
days after cisplatin infusion could not be attributed 
to differences in peak plasma Pt levels. 


In previously published animal studies, cisplatin ` 


mostly was administered intraperitoneally or subcu- 
taneously, and changes in hearing have been mainly 
measured electrophysiologically by use of ABRs and 
cochlear microphonics. Hitherto, there has been only 
1 study that investigated the ototoxic effects of cis- 
platin on DPOAE thresholds.” The latter study re- 
ported changes in DPOAE thresholds 5 days after 
subcutaneous administration of a high dose (10 mg/ 
kg) of cisplatin to Mongolian gerbils. In the present 
study, however, cisplatin was administered to dogs 
in the same way it is given to humans (a clinical dose 
infused intravenously per unit surface area), and 
DPOAE thresholds were measured in dogs with com- 
mercially available equipment. Furthermore, the pres- 
ent study is the first to investigate the early ototoxic 
effects of cisplatin on DPOAE thresholds in dogs. 
Despite the above-mentioned methodological differ- 
ences between the present study and the study by 
Sie and Norton,?2 the findings of both studies were 
nonetheless similar. An elevation of DPOAE thresh- 





old was observed within 4 days after the administra- 
tion of cisplatin. The present study demonstrated that 
the DPOAE thresholds have the potential to be used 
as an effective tool to investigate early ototoxic ef- 
fects of cisplatin. 


In conclusion, the present study found that DPOAEs 
— in particular, DPOAE thresholds — have the po- 
tential to be used as a clinical measure to detect early 
cisplatin-induced ototoxicity. In addition, the present 
study revealed that the variation in the elevation of 
DPOAE thresholds between animals could not be at- 
tributed to variations in plasma concentrations of fil- 
terable Pt. However, there are limitations to this pre- 
liminary study that need to be addressed in future 
studies. First, a larger number of dogs may be needed 
to provide a more accurate assessment of the interani- 
mal variability in OAE and ABR responses. Second, 
the frequency range of the OAE measurements in the 
present study was limited to 6 kHz. This was a limi- 
tation of the commercially available equipment used 
in the study, and future studies using equipment (be 
it a customized system or a commercially available 
one) that would allow collection of OAE data at fre- 
quencies up to 20 kHz are warranted in order to cap- 
ture any early ototoxic effects of cisplatin at the high- 
er frequencies. Finally, cisplatin is usually adminis- 
tered every 3 or 4 weeks cyclically to dogs in veteri- 
nary oncology clinics,!4 whereas in the present study, 
only the effects of a single of such high doses of cis- 
platin were studied. Future studies may benefit by 
addressing any cumulative effects of repeated clini- 
cal doses of cisplatin on OAEs. 
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The presence of multiple giant cell lesions in the maxillofacial region is suggestive of hyperparathyroidism or neurofibromato- 
sis. A case of synchronous, multiple peripheral giant cell granulomas is reported, bilaterally affecting the mandibular gums and also 
the upper right maxillary gingival mucosa, without concomitant systemic disease. Only 2 cases of central giant cell granulomas, and 
no cases of peripheral giant cell granulomas exhibiting similar characteristics, have been reported in the literature to date. 


KEY WORDS — hyperparathyroidism, multiple peripheral giant cell granulomas, neurofibromatosis. 


INTRODUCTION 


Peripheral giant cell granuloma (PGCG) is a rela- 
tively common lesion of the oral cavity, accounting 
for 0.4% to 1.9% of all oral surgical pathology ma- 
terial, and appears as a localized tumorlike enlarge- 
ment of the gums and alveolar ridge.! Clinically, 
PGCG appears as a small, well-delimited, dark red 
focal mass on the gingiva with a sessile or peduncu- 
lated base that usually originates from either the perio- 
dontal ligament or the mucoperiosteum.23 The un- 
derlying cause of PGCG remains unclear, even though 
the disorder is often associated with local irritation. 
It is generally regarded as a reactive, non-neoplastic 
lesion composed of granuloma-like tissue dominated 
by multinucleated giant cells.4 Unlike so-called cen- 
tral giant cell granuloma (CGCG), PGCG either does 
not affect bone or does so as a consequence of me- 
chanical action upon the underlying bone. Further- 
more, while CGCG typically affects young patients, 
with expansion and perforation of the underlying cor- 
tical bone,> 50% of all PGCGs are found in patients 
between 40 and 59 years of age,® and their principal 
clinical manifestation is the presence of a gingival 
mass. 


The literature presents extensive series of PGCG 
(79 cases in one series? and 720 cases in another’), 
but no multiple and synchronous presentations of the 
disorder have been reported to date. The present study 
describes a case of PGCG in which 3 lesions simul- 
taneously affected the mandible and upper jaw. 


CASE REPORT 
A 58-year-old man presented with a painless tu- 


mor mass that had affected the left mandibular gums 
for the past month. The patient was edentulous and 
wore dentures. Oral examination revealed the pres- 
ence of 2 independent tumor masses affecting the 
right and left mandibular gums and measuring ap- 
proximately 1.5 and 2.5 cm in greatest diameter, re- 
spectively (Fig 1). A third lesion of similar character- 
istics was also identified in the vestibular gingival 
mucosa of the right upper jaw, with a greatest diam- 
eter of 1.5 cm. All 3 lesions were sessile, well delim- 
ited, non-ulcerated, painless, non-hemorrhagic, and 
dark red in color. No cervical adenopathies were ob- 
served. Orthopantomography revealed osteolysis of 
apparently mechanical origin underlying the mandib- 
ular lesions. Computed tomography in turn revealed 
the presence of 3 osteolytic zones affecting the ex- 
ternal cortical layer of the left parasymphyseal re- 
gion, the right mandibular body, and the antral zone 
of the right upper jaw (Figs 2 and 3). 


The findings of a histopathologic study of an inci- 
sion biopsy of the left parasymphyseal tumor were 
compatible with reparative giant cell granuloma. Lab- 
oratory tests performed to evaluate clinically sus- 
pected parathyroidism revealed levels of c-parathy- 
roid hormone (PTHc) of 30 pg/mL (reference range, 
10 to 65 pg/mL), 25-hydroxycholecalciferol of 20 
ng/mL (reference range, 10 to 60 ng/mL), 1-25-dihy- 
droxycholecalciferol of 34 pg/mL (reference range, 
10 to 60 pg/mL), calcemia of 9.79 mg/dL, phospho- 
rus of 2.8 mg/dL, alkaline phosphatase of 184 U/L 
(reference range, 98 to 280 U/L), and fractionated 
alkaline phosphatase of 69 U/L. Hyperparathyroid- 
ism was therefore discarded as a diagnosis. The 3 
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Fig 1. Oral exploration of mandibular gingival lesions. 


lesions were subjected to subperiosteal exeresis and 
curettage of the bone bed under general anesthesia. 
The histologic study of the resection pieces con- 
firmed the biopsy findings, ie, PGCG (Fig 4). No re- 
currences have been observed after 6 months of fol- 
low-up. 
DISCUSSION 

Cherubism, hyperparathyroidism, fibrous dyspla- 
sia, aneurysmal bony cyst, ossifying fibroma, neuro- 
fibromatosis, chondroblastoma, osteosarcoma, and 
some cases of histiocytosis X can present giant cells 
— although in the maxillofacial region, CGCG and 
PGCG are the most prevalent lesions possessing such 
histologic features. Although the literature describes 
5 cases of multiple CGCGs in the absence of hyper- 
parathyroidism (2 of them associated with neurofi- 
bromatosis type I),8° a review of the published se- 
ries revealed no multiple and synchronous presenta- 
tions of PGCG aside from this disease. Consequently, 
the suspected diagnosis after the biopsy in our case 
was brown tumor secondary to hyperparathyroidism. 
In general, fewer than 10% of cases of hyperparathy- 
roidism show signs of giant cell lesions of the jaws 





Fig 2. Computed axial tomography scan of mandible 
shows areas of mechanical osteolysis produced by lesions. 





Fig 3. Computed axial tomography scan of upper jaw 

shows upper maxillary lesion and external cortical osteo- 

lysis in molar region. 
(central or peripheral).!9 Primary hyperparathyroid- 
ism, which occurs mainly in middle-aged women 
(75% of cases), is due to excessive secretion of para- 
thyroid hormone (PTH) from an adenoma (85%), car- 
cinoma (5%), or idiopathic hyperplasia (10%) of the 
parathyroid glands.'' Brown tumor lesions are rarely 
seen in primary hyperparathyroidism; the patients 
present elevated blood PTH and calcium levels. On 
the other hand, it is well known that chronic renal 
failure can cause renal osteodystrophy due to second- 
ary hyperparathyroidism. It is reported that the inci- 
dence of hyperparathyroidism in patients with chron- 
ic renal failure ranges from 18% after | year on di- 
alysis to 92% after more than 2 years.!? In these cases, 
blood PTH levels are elevated, although the calcium 
concentrations tend to be low.!3 In our patient, the 
laboratory tests (including PTHc) yielded values in 
the normal range; this fact, associated with the lack 
of other clinical manifestations (kidney stones, hy- 





Fig 4. Histopathologic findings. A) Moderately hyper- 
plastic mucosa (H & E, original x40). Underlying tissue 
with abundant siderophages. Irregular granulomatous for- 
mations with numerous giant multinuclear cells delim- 
ited by bands of connective tissue. B) Granulomatous 
lesion infiltrating bone (H & E, original x250). 








Junquera et al, Peripheral Giant Cell Granulomas of Gums 753 


pertension, neuromuscular and gastrointestinal dis- 
orders) and the absence of a history of kidney fail- 
ure, ruled out primary or secondary hyperparathy- 
roidism. 

Neurofibromatosis type I is one of the most com- 
mon autosomal dominant diseases, with an estimated 
frequency of 1 in every 2,500 to 3,000 births.!4 The 
skeletal system is often affected, and bony abnor- 
malities are present in 50% to 70% of patients with 
the condition.’ In our case, the possibility of neuro- 
fibromatosis was discarded, for despite the age of 
the patient, there was no prior diagnosis or clinical 
manifestations of the disease. 


Peripheral giant cell granuloma particularly affects 
the mandibular gums in the premolar and molar re- 
gions. Most cases published in the literature occurred 
in individuals in the 40- to 60-year age interval (range, 
8 to 85 years), with a predominance of women.’ Bod- 
ner et al? reported a male-female ratio of 1 to 2.75 
for tumors over 2 cm in size, and a male-female ra- 
tio of 1 to 1.5 for smaller lesions. 


The pathogenesis of PGCG remains uncertain, al- 
though the participation of local stimuli is generally 
accepted! (ie, tooth extractions, ill-fitting dentures, 
overhanging restorations, periodontitis, periodontal 
surgery, and deficient oral hygiene). Hypoptyalism 
has also been described as a triggering factor, in that 
the oral mucosa loses an important protective ele- 


ment under such conditions.* Our patient had had 
his last teeth removed 3 years earlier, and the com- 
plete dentures were ill-fitting — although we cannot 
categorically affirm that these factors were implicated 
in the development of the lesions. Mighell et al!® re- 
ported that none of their cases of PGCG in edentu- 
lous gaps were related to recent tooth extractions. 


Peripheral giant cell granuloma is more common 
than CGCG, at a ratio of 3:1.!5 Local bone resorp- 
tion has been observed in 28% of cases of PGCG.!7 


Surgical resection of the lesions is the treatment 
of choice, with recurrence rates of between 5% and 
70.6%, according to the literature.!8:!9 This wide vari- 
ation is probably attributable to the surgical techniques 
used, since recurrences excised to the periosteum did 
not occur again.!> It has been reported that the risk 
of recurrence increases in the presence of hyperpara- 
thyroidism.!3 

CONCLUSION 

The presence of multiple giant cell lesions in the 
maxillofacial region is suggestive of hyperparathy- 
roidism or neurofibromatosis. The present study doc- 
uments a case of multiple, synchronous PGCGs af- 
fecting the mandibular gums bilaterally and also the 
upper right maxillary gingival mucosa— with no con- 
comitant systemic disease. Only 2 cases of CGCG, 
and no cases of PGCG, exhibiting similar character- 
istics have been reported in the literature to date. 
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Although granular cell tumors are relatively common in the head and neck, symptomatic granular cell tumors of the neurohypo- 
physis are extremely rare. Ophthalmologic symptoms are most common, followed by endocrinologic manifestations. We report a 
case of a granular cell tumor of the pituitary fossa that was surgically treated. The clinical manifestations, radiographic appearance, 
and surgical management of granular cell tumors of the pituitary fossa are reviewed, as well as the unique histopathology and 


electron microscopy of this uncommon neoplasm. 


KEY WORDS — granular cell tumor, neurohypophysis, pituitary neoplasm. 


INTRODUCTION 


Granular cell tumors (GCTs) may occur in virtu- 
ally any anatomic site, although they have a predi- 
lection for the head and neck region. Subcutaneous 
nodules and oral cavity lesions account for about one 
third of all GCTs.! Large, symptomatic GCTs of the 
neurohypophysis are extremely rare, with fewer than 
40 cases reported in the literature.>* These benign 
tumors are usually small and asymptomatic and have 
been found fairly commonly as incidental findings 
in autopsies.? Since the initial description by Boyce 
and Beadles!° in 1893, the histiogenesis of GCTs has 
been greatly debated. Synonymous terms such as pi- 
tuicytoma, myoblastoma, infundibuloma, and choris- 
toma have been used. 


We report a case of a GCT of the pituitary fossa 
that was surgically treated by our departments. We 
present the radiographic appearance, endoscopic re- 
section, and immunohistochemical and electron mi- 
croscopic results. 


CASE REPORT 


A 65-year-old man presented with a 3-day history 
of progressive occipital headache and dizziness. His 
past medical history was significant for hypertension 
and a left cerebrovascular accident (CVA) in 1984. 
The patient denied nausea, vomiting, visual distur- 
bances, or neck stiffness. A review of systems was 
noncontributory. Besides the baseline mild right- 
sided paresis from the prior CVA, the remainder of 


the physical examination yielded normal findings. 


In light of the past history of CVA, a computed 
tomography scan of the brain was ordered. A noncon- 
trast scan revealed a noncalcified, smooth, well-de- 
fined, hyperdense mass in the pituitary fossa. A mag- 
netic resonance imaging (MRI) scan demonstrated a 
space-occupying lesion within the sella turcica that 
could not be differentiated from the pituitary stalk 
and was abutting but not compressing the optic chi- 
asm. On T1-weighted images, the tumor was isoin- 
tense with the brain parenchyma and showed nonho- 
mogeneous contrast enhancement with some areas 
of questionable necrosis (Fig 1). T2-weighted im- 
ages revealed a heterogeneous signal reduction of 
the tumor. A magnetic resonance angiogram revealed 
no tumor blush. Endocrinologic and ophthalmologic 
consultations were obtained, both of which revealed 
no abnormalities. 


The presumptive diagnosis was a pituitary adeno- 
ma. The patient underwent an endoscopic transsphe- 
noidal approach for resection of the tumor. Upon re- 
section of the posterior sphenoidal wall, a firm, tan- 
colored, highly vascular mass was encountered. Be- 
cause of the atypical consistency of the mass, a sam- 
ple was sent for frozen section analysis. Histologic 
analysis revealed a probable diagnosis of GCT. The 
tumor was 90% debulked, and specimens were sent 
for immunohistochemical staining. In addition, a 1- 
mm cube of tumor was cut with a scalpel on a sterile 
tongue blade and immediately immersed in cold Kar- 
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Fig 1. Tl-weighted magnetic resonance im- 
ages. A) Sagittal image demonstrates granular 
cell tumor (T), infundibulum (arrow), and sphe- 
noid sinus (S). B) Coronal image with gado- 
linium contrast demonstrates heterogeneous en- 
hancement of tumor (T). Note relationships to 
optic chiasm (arrow), sphenoid sinus (S), and 
internal carotid artery (ICA). 


novsky’s fixative for ultrastructural study. The sphe- 
noid sinus was packed with tensor fascia lata, fat, 
and fibrin glue. The patient had an uneventful post- 
operative course and has remained asymptomatic for 
several years since the operation. 


HISTOPATHOLOGIC FINDINGS 

The surgically resected specimen was fixed in 10% 
neutral buffered formalin overnight, embedded in 
paraffin, cut in 4-um-thick sections, and stained with 
hematoxylin and eosin and periodic acid—Schiff (PAS) 
stain. Sections revealed a tumor growing in sheets 
and circumscribed columns of plump cells with 
brightly eosinophilic, coarsely granular cytoplasm 
and eccentric, often hyperchromatic nuclei. The cy- 
toplasmic contents were brilliantly PAS-positive. 
Tumor cells were admixed with occasional lympho- 
cytic infiltrate (Fig 2A). 


Immunohistochemical analyses were performed 
on paraffin-embedded tissue to identify 4 antibod- 
ies: mouse anti-human epithelial membrane (EMA; 
1:200 dilution, clone E29, Dako), rabbit anti-cow S- 
100 (1:200 dilution, Dako), low—molecular weight 
keratin cocktail (AE-1; undiluted, Signet Laborato- 
ries), and mouse anti-human CD68 (KP-1; 1:20 di- 
lution, Dako). Immunohistochemical analysis was 
performed by a routine avidin-biotin technique with 
diaminobenzidine as chromogen, and the sections 
were counterstained with hematoxylin. The tumor 
cytoplasm stained diffusely for S-100 and failed to 
stain for AE-1, EMA, or KP-1. 


ULTRASTRUCTURAL FINDINGS 


The glutaraldehyde-fixed specimen was processed 








for examination with a JEOL 100CX electron micro- 


scope. In brief, tissue was osmicated, dehydrated, and 
embedded in EM-bed 812 (EPON 812 substitute, 
Electron Microscopy Sciences, Fort Washington, 
Pennsylvania). One-micron-thick sections were cut 
and stained with toluidine blue to select regions for 
thin sectioning. Ultrathin sections were cut, stained 
with uranyl acetate and lead citrate, and examined. 
Sections revealed regions of densely packed tumor 
cells, despite suboptimal structural maintenance. Tu- 
mor cells contained round nuclei, often in an eccen 
tric position. Tumor cytoplasm contained multiple 
lysosomes of various shapes and sizes and occasional 
mitochondria (Fig 2B). 


DISCUSSION 

The cellular origin of neurohypophyseal GCTs 
remains controversial. The numerous names given 
to the tumor reflect this uncertainty. The term myo- 
blastoma reflects the histologic similarities between 
GCTs and muscle cells.'! More recent studies, how 
ever, have pointed toward a Schwann cell origin for 
these lesions.!? This conclusion is based on the ul 
trastructural observation of a prominent basement 
membrane surrounding each granular cell, lack of 
typical cytochemical findings akin to skeletal muscle, 
and immunoreactivity for S-100 and neuron-specific 
enolase. !2-!4 


Neurohypophyseal GCTs may represent a distinct 
subtype of GCTs, despite similarities to those in other 
sites. Whereas extracranial GCTs are usually immu- 
noreactive to S-100, neuron-specific enolase, myelin 
basic protein, and vimentin, neurohypophyseal GCTs 
have been shown to be immunoreactive only for S- 
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100 and occasionally for macrophage-histiocyte 
markers.'> The cellular origin of neurohypophyseal 
GCTs remains elusive. 


As in our case, the definitive diagnosis is usually 
made histologically from a surgical specimen. The 
difficulty in making the correct diagnosis is related 
to the nonspecific clinical symptoms and radiograph- 
ic findings. Often these tumors are misdiagnosed be- 
fore operation as craniopharyngiomas, meningiomas, 
microadenomas, or pilocytic astrocytomas. 


Histopathologic examination of our case revealed 
typical features of aGCT, including pyramidal to spin- 
dle-shaped cells with eccentric small nuclei and abun- 
dant PAS-positive cytoplasmic granules, a distinctive 
feature of these tumors. The tumor cytoplasm was 
immunoreactive for antibodies against S-100, as typi- 
cally described in GCTs of the neurohypophysis. 


Ultrastructural examination of our tumor revealed 





Fig 2. Findings on microscopy. A) Light mi- 
croscopic examination of tumor reveals cells 
with ample, delicate granular cytoplasm and 
eccentric nuclei (black arrow; H & E, original 
x900). Scattered throughout tumor are occa- 
sional mature lymphocytes (open arrow). B) 
Ultrastructural examination of tumor cells by 
electron microscopy reveals eccentric nuclei 
and ample cytoplasm filled with lysosomal 
structures of various sizes (asterisks; original 
x11,620). 


classic eccentric nuclei and abundant cytoplasm filled 
with lysosomelike structures of various sizes. The 
granularity seen on light microscopy is due to this 
dense population of cytoplasmic lysosomes. Distinct 
basement membrane surrounding each tumor cell 
could not be discerned because of suboptimal tissue 
preservation. 


Other lesions that may have a similar histologic 
appearance include pilocytic astrocytoma, pituitary 
adenoma, and histiocytic masses. However, this tu- 
mor did not contain any Rosenthal fibers or eosino- 
philic granular bodies, as are commonly seen in pilo- 
cytic astrocytomas. No glandlike structures or granu- 
lomas were seen, and the tumor failed to stain with 
immunohistochemical stains for EMA, AE-1, or KP- 
i; 


Large, symptomatic GCTs of the pituitary fossa are 
extremely uncommon, although granular cell nests 
are relatively common. !° Liwnicz et al,° in their liter- 


a ~~ 
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ature review of 30 cases to 1984, described some of 
the clinical features of this tumor. The majority of 
patients were more than 40 years of age at the time 
of diagnosis (range, 26 to 78 years), with a 3:2 fe- 
male-to-male ratio. Lafitte et al,4 in their review of 
the literature, found that ophthalmological symptoms 
such as loss of visual acuity, visual field defects, and 
optic atrophy occurred in approximately two thirds 
of the reported cases. In addition, endocrinologic 
manifestations — which included amenorrhea, steril- 
ity, impotence, decreased body hair, diabetes insipi- 
dus, and weight gain or loss — were seen in only 
half of the cases. Persistent headache, memory loss, 
and behavior disorders were more rarely seen. 


Radiographic descriptions in the literature further 
attest to the nonspecific nature of these tumors. Plain 
films may reveal an expansile sellar mass eroding 
bone without calcifications.!*!8 Computed tomo- 
graphy scans of GCTs usually show a well-demar- 
cated noncalcified tumor that is denser than the cere- 
bral parenchyma and enhances intensely with intra- 
venous contrast material.2> Magnetic resonance im- 
aging findings have been recently described by sev- 
eral authors.*>:7,8 The tumor is generally isointense 
on T1- and T2-weighted images, with nonhomogene- 
ous enhancement after administration of gadolinium— 
diethylenetriamine pentaacetic acid. The heterogene- 
ous enhancement has been attributed to the richly vas- 
cularized pseudocapsule. Unfortunately, these MRI 
findings are nonspecific; similar characteristics are 
found in cases of meningioma or chromophobic ade- 
noma. Magnetic resonance angiography (normal in 
our case) and cerebral angiography (not performed 
in our case) have yielded variable findings in terms 
of neovascularization of the mass.2+17.19 


The location and size of the tumor dictate the ap- 
proach to the tumor. Transcranial and transsphenoidal 
surgical approaches have been described.? In either 
approach, profuse bleeding is frequently encountered, 
and the tumors are often quite firm and difficult to 
remove, as was found in the present case. Decom- 
pression with partial excision is most commonly de- 
scribed, with complete excision via a craniotomy less 





often achieved.?+7,!17-19 We selected an endoscopic 
transsphenoidal approach for this case. Although the 
tumor was adequately debulked, the excessive vas- 
cularity and unclear diagnosis hindered complete ex- 
cision. 


Although the tumor is slow-growing and benign, 
early detection can prevent symptomatic complica- 
tions as the tumor enlarges. Compression of the op- 
tic nerve, pituitary gland, hypothalamus, and third 
ventricle are possible with unchecked growth. Infor- 
mation on alternative or adjunctive therapy is scant. 
Becker and Wilson? reviewed 20 treated cases of 
GCT with and without radiotherapy as an adjunctive 
or primary mode of therapy. This study concluded 
that radiotherapy was of little or no benefit and did 
not alter the short- or long-term prognosis — a con- 
clusion corroborated by other case reports.!7-19 


Malignant GCTs are exceedingly rare. Kershisnik 
et al! reviewed more than 30 cases of extracranial 
malignant GCTs in the literature. Six of the 33 re- 
ported malignant GCTs were located in the head and 
neck region. However, there are no reports of a ma- 
lignant GCT arising from the neurohypophysis. 


Relapses, although infrequent, have been attrib- 
uted to incomplete excisions, usually manifesting sev- 
eral years later.24-6 Complete surgical excision ap- 
pears to be the treatment of choice. However, in view 
of the location, high vascularity, and benign, slow- 
growing nature of the tumor, a partial removal that 
adequately decompresses the optic chiasm is usually 
all that is recommended.6 Yearly surveillance MRI 
scans should be obtained to check for recurrence of 
the tumor. Symptomatic recurrences are amenable 
to repeat surgical decompression. 


In summary, although GCTs are relatively com- 
mon in the head and neck region, symptomatic GCTs 
of the pituitary fossa are a rare, yet classic pathologi- 
cal entity. Otolaryngologists operating in the sellar 
region should be familiar with this tumor and include 
it in the differential diagnosis of neurohypophyseal 
neoplasms. 
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INTRODUCTION 


Chronic maxillary atelectasis (CMA) is character- 
ized by a persistent decrease in maxillary sinus vol- 
ume as a result of inward bowing of one or more 
antral walls.! This condition is believed to result from 
a flaccid medial infundibular wall (MIW) that causes 
flap valve occlusion of the natural ostium. Findings 
in patients with CMA include 1) fontanelle retrac- 
tion and lateral displacement of the MIW on rhinos- 
copy, 2) sinus opacification on computed tomogra- 
phy, and 3) negative antral pressure on manometry.” 

Patients with stage I and II CMA often present 
with symptoms of sinusitis, but patients with stage 


III disease may be asymptomatic. This absence of 


significant symptoms presumably allows the condi- 
tion to go undetected until the patient presents in an 
already advanced stage with enophthalmos, hypoglo- 
bus, and/or malar depression deformity. The nega- 
tive pressure characteristically found within the af- 


fected sinus is believed to contribute to bone remod- 
eling and subsequent progressive facial deformities.’ 
What effects, if any, restoration of antral ventilation 
has on this process in adults is not known. We present 
an adult with confirmed stage III CMA, provide pho- 
tographic evidence of progressive hypoglobus, and 
objectively compare preoperative and postoperative 
computed tomographic (CT) scans for changes in the 
bony walls of the affected sinus. 


CLINICOPATHOLOGICAL FINDINGS 


A 32-year-old woman was in her usual state of 
health until 3 months before her initial consultation, 
when friends noticed that her left eye was in a lower 
position than her right eye. The patient recalled ex- 
periencing bilateral intermittent facial discomfort for 
many years, but the episodes were rarely associated 
with congestion or nasal drainage. Over a 3-month 
period, she noticed a drop in the position of her left 
eye, along with a pulling sensation just above the eye. 





Fig 1. Coronal computed tomographic scans performed before and after middle meatal antrostomy (MMA) show A) near-total 
antral opacification, fontanelle retraction (arrowhead), and laterally malpositioned medial infundibular wall before operation, 
versus B) well-aerated left antrum and widely patent left MMA (arrowhead) after operation. Arrows — left orbital floor 
deformity. 
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Fig 2. Axial computed tomographic scans performed before and after MMA show A) near-total antral opacification and 
inward bowing of posterolateral wall (arrow) before operation, versus B) well-aerated left antrum with widely patent MMA 
(arrowhead) and outward bowing of posterolateral antral wall (arrow) after operation. 


She also reported diplopia on upward gaze. A physi- 
cal examination was notable for 4 mm of left hypo- 
globus without malar depression deformity. Extra- 





Fig 3. Photographs taken over 8-year period before di- 
agnosis. All photographs were digitally scanned, dimen- 
sionally normalized, and oriented to bring line drawn 
between medial and lateral canthi into horizontal posi- 
tion. Horizontal line was drawn from pupillary light re- 
flex of unaffected right eye to left eye to depict progres- 
sion of hypoglobus. Time periods before diagnosis were 
A) 8 years, B) 2 years, C) 1 year, and D) 1.5 months. 


ocular movement was intact, with diplopia on up- 
ward gaze. Endoscopy showed a widened middle me- 
atus owing to a lateralized MIW and fontanelle. A 
CT scan demonstrated stage HI CMA (Figs 1A and 
2A), and a left endoscopic middle meatal antrostomy 
(MMA) was performed. Intraoperative findings in- 
cluded complete occlusion of the left maxillary si- 
nus ostium by the lateralized MIW. The medial orbi- 
tal wall was well visualized superiorly, above the re- 
tracted fontanelle. A pressure of —4 cm H20 was 
found within the left antrum. The ostium was revealed 
by peeling the MIW off the medial wall of the an- 
trum with an ostium seeker. Inspissated mucus was 
suctioned and an MMA was created. The patient 
elected to forgo orbital reconstruction and did well 
after the operation. A patent middle MMA was ob- 
served at follow-up, and no change in hypoglobus 
was found according to subsequent eye measure- 
ments. 


PHOTOGRAPHIC FINDINGS 


Photographs taken over an 8-year period before 
diagnosis were obtained with the patient’s consent 
and retrospectively reviewed. All photographs were 
digitally scanned and images were rotated to bring 
an arbitrary straight line drawn through the inner and 
outer canthi of each orbit to a horizontal plane. A 
horizontal line was then drawn through the pupil- 
lary light reflex of the unaffected eye to the opposite 
eye. We were then able to pictorially depict the pro- 
gressive nature of the hypoglobus by this horizontal 
line and its relationship to the affected eye’s pupil- 
lary light reflex (Fig 3). 


RADIOLOGIC FINDINGS 
Preoperative CT scans in the axial and coronal 
planes showed fontanelle retraction, a laterally mal- 
positioned MIW, inferior depression of the left orbi- 
tal floor, and left maxillary sinus opacification with 
inward bowing of the posterolateral antral wall (Figs 
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DIMENSIONAL MEASUREMENTS OBTAINED AT FIVE 
TIME POINTS OVER 40-MONTH PERIOD 











Maxillary Sinus Maxillary Sinus 
Volume (mL) Surface Area (cm?) 
Age (y) Right Left Right Left 





33.3 20.68 +0.14 13.99 +0.06 44.68 +0.04 39.80 +0.18 
33.6 20.94 +1.00 14.21 +0.05 45.90 +1.40 40.77 +0.10 
33.9 19.93 +0.10 16.23 +0.11 44.96 +0.04 41.39 0.35 
35.5 21.0040.05 16.84 +0.04 42.62 +0.12 41.11 +0.01 
36.5 20.63 +0.17 16.87 £0.03 


Data are mean + SD. 


1A and 2A). A CT scan performed 8 months after 
operation showed a patent MMA, inferior depres- 
sion of the left orbital floor, and a well-aerated left 
antrum with outward bowing of the posterolateral 
wall (Figs 1B and 2B). Antral volume and surface 
area measurements were taken from 5 separate coro- 
nal CT sections taken before and after operation over 
a 40-month period. The scans were traced and regis- 
tration points were digitized manually with an elec- 
tronic data tablet. Standard computerized planimet- 
ric equations were used to determine the area of each 
region in each section. The volumes were then calcu- 
lated as the sum of 2- and 5-mm-thick sections with 
areas equal to the region areas. This “stacked coin” 
volume approximation is accurate to within 10% of 
the actual volume.* 


The normal-appearing right sinus showed virtual- 
ly no change in volume over the 40-month time pe- 
riod (see Table). The volume of the atelectatic left 
antrum increased significantly, from 13.99 mL be- 
fore operation to 16.87 mL after operation. Seventy- 
eight percent of this volumetric increase occurred 
within the first 8 months after surgery (Fig 4). This 
growth coincided with radiographic changes seen in 
the posterolateral antral wall over this same time in- 
terval. 





DISCUSSION 


Predisposition to the development of CMA is 
thought to be a result of a deficiency in the posterior 
attachment of the uncinate process to the inferior tur- 
binate bone. Flap valvular occlusion of the natural 
ostium may be precipitated by mucosal edema in- 
volving the middle meatal structures.) Asymptom- 
atic patients with so-called silent sinus syndrome are 
more likely to go untreated until there is progression 
to late-stage disease.5-? The development of stage III 
disease over an 8-year period in our patient suggests 
chronic antral obstruction with consequent osseous 
remodeling.? However, the impact of such factors as 
negative pressure, sinus inflammation, and bony wall 
thickness on progression to stage III disease remains 
unknown.! The exact time frame for this process to 
occur is also unknown, but we have presented evi- 
dence that hypoglobus can be progressive over 
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Fig 4. Graph of maxillary sinus volume as function of f 
age. 


months to years. 


Most of the increase in sinus volume seen in the 
patient presented in this report correlates with the 
reversal of the internal convexity of the posterolateral 
antral wall. This wall and the superior wall have the 
thinnest bone and, not surprisingly, show the most 
impressive antral deformities.8 The left antrum at- 
tained only 82% of the total volume of the right after 
operation. The diminished postoperative volume can 
be accounted for by radiographic evidence of persis- 
tent orbital sagging. On the basis of our observations 
from this case, the effects of bone remodeling on an- 
tral dimensions and contours after the reversal of neg- 
ative pressure appear to occur early (within the first 
6 to 12 months). 


Endoscopic MMA is the standard of care after an 
initial course of medical therapy has failed. How- 
ever, in late-stage disease, an otolaryngologist should 
recognize the disturbance in orbital anatomy to avoid 
inadvertent orbital entry.? Reconstruction of the orbi- 
tal floor was an option for the patient presented, but 
she elected not to undergo this procedure. Approaches 
to orbital reconstruction include transconjunctival and 
subciliary orbitotomy. The orbital floor is built up 
by layering alloplastic (titanium micromesh or po- 
rous polyethylene sheets) or autogenous implants in- 
ferior to the orbital periosteum. Special care is recom- 
mended to avoid exposing the implants through the 
antral mucosa during placement, to limit the chance 
for infection or extrusion.? 


Late-stage disease as described in this adult case 
of CMA should not be generalized to children, be- 
cause there may be other anatomic deformities, such 
as maxillary sinus hypoplasia, affecting the disease 
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manifestations in pediatric CMA. Children most of- 
ten have stage I of the disease, whereas adults have 
been seen with the full spectrum of deformities and 
stages.’ 


Our findings show that restoration of antral venti- 
lation in a patient with stage I CMA does not result 


in clinically significant reversal of bony deformities 
involving the orbital floor, although such changes 
were seen in the posterolateral wall. We suggest that 
surgeons give serious consideration to orbital recon- 
struction at the time of MMA in patients with sig- 
nificant ocular manifestations of stage HI CMA who 
desire corrective oculoplastic surgery. 


REFERENCES 


I. Kass ES, Salman S, Rubin PAD, Weber AL, Montgomery 
WW. Chronic maxillary atelectasis. Am Otol Rhinol Laryngol 
1997:106:109-16. 

2. Kass ES, Salman S, Montgomery WW. Manometric study 
of complete ostial occlusion in chronic maxillary atelectasis. 
Laryngoscope 1996;106:1255-8. 

3. Kass ES, Goodman ML, Salman S. Surgical management 
of chronic maxillary atelectasis. Operative Techn Otolaryngol 
Head Neck Surg 2001;12:28-33. 

4. Huang GT, Rosowski JJ, Flandermeyer DT, Lynch TJ II], 
Peake WT. The middle ear of a lion: comparison of structure 
and function to domestic cat. J Acoust Soc Am 1997;101:1532- 
49, 

5. Soparkar CN, Patrinely JR, Cuaycong MJ, et al. The si- 


lent sinus syndrome. A cause of spontaneous enophthalmos. 
Ophthalmology 1994;101:772-8. 


6. Dailey RA, Cohen JI. Surgical repair of the silent sinus 
syndrome. Ophthal Plast Reconstr Surg 1995;11:261-8. 


7. Rose TP. Spontaneous enophthalmos associated with 
asymptomatic maxillary sinus disease (silent sinus syndrome): 
case report. J Am Optom Assoc 1998:69:236-40. 


8. Kass ES. The diagnosis and treatment of chronic maxil- 
lary atelectasis in adults and children. Curr Opin Otolaryngol 
Head Neck Surg 1999;7:39-43. 


9. Kass ES, Salman S. Developmental changes in the an- 
trum of a child with chronic maxillary atelectasis. Ann Otol 
Rhinol Laryngol 1999;108:1091-4. 


This publication is available 
in microform. 


UMI reproduces this publication in microform: micro- 
fiche and 16 or 35mm microfilm. For information about 
this publication or any of the more than 16,000 peri- 
odicals and 7,000 newspapers we offer, complete 
and mail this coupon to UMI, 300 North Zeeb Road, 
Ann Arbor, MI 48106 USA. Or call us toll-free for an im- 
mediate response: 800-521-0600. From Alaska and 
Michigan call collect 313-761-4700. From Canada 
call toll-free 800-343-5299. Name 
UMI ln 

Company/Institution 


A Bell & Howell Company Address 

300 North Zeeb Road : : 
Ann Arbor, MI 48106 USA City/State/Zip 
800-521-0600 toll-free Phone ( 
313-761-4700 collect from Alaska and Michigan 

800-343-5299 toll-free from Canada 



































9a 


CLINICAL FELLOWSHIP 
IN OTOLOGY 
AND NEUROTOLOGY 


MASSACHUSETTS EYE AND 
EAR INFIRMARY 


Fellowship training is offered for individuals 
who aspire to a full-time academic career. This 
fellowship will provide opportunities for clini- 
cal research. Two-year fellowships are avail- 
able every other year beginning in 2004. Board 


eligibility or certification is required. 


For more information, contact: 


Michael J. McKenna, MD 
Dept of Otolaryngology 
Massachusetts Eye and Ear Infirmary 
243 Charles Street 
Boston, MA 02114 
(617) 573-3654 


PEDIATRIC OTOLARYNGOLOGY 
RESIDENCY PROGRAM 


Fellowship 


July 1, 2004 - June 30, 2006 


University Health Center of 
Pittsburgh Program 
Children’s Hospital of Pittsburgh 
Pittsburgh, PA 


Accredited by the Residency Review 
Committee Accreditation Council for 
Graduate Medical Education 
Direct inquiries to: 
Margaretha L. Casselbrant, MD, PhD 
Department of Pediatric Otolaryngology 
Children’s Hospital of Pittsburgh 
3705 Fifth Avenue 
Pittsburgh, PA 15213 
(412) 692-8577 
e-mail: pamela.wilson@chp.edu. 

An Equal Opportunity/Affirmative Action Employer. 


Qualified minority and female applicants are encouraged to apply. 
A166 








INDEX TO ADVERTISERS 





Annals Publishing Company 7a, cover 3 


Anthony Products, Inc 


cover 4 


Children’s Hospital of Pittsburgh 


Endocraft LLC 


Gruppo Otologico 


Jedmed Instrument Company 


Massachusetts Eye & Ear Infirmary 10a, cover 3 


Mount Sinai School of Medicine cover 3 


Santa Barbara Medco, Inc cover 2 


University Medical Center Nijmegen 


University Microfilms International 


. 


Every precaution is taken to ensure accuracy, but we 
cannot guarantee against the possibility of an occasional 
change or omission in the preparation of this index. 











FELLOWSHIP IN 
MICROVASCULAR 
RECONSTRUCTION OF 
THE HEAD AND NECK 


MOUNT SINAI SCHOOL OF 
MEDICINE 


NEW YORK, NEW YORK 


The Mount Sinai School of Medicine, Department of Oto- 
laryngology—Head and Neck Surgery, is offering a one- 
year, hands-on fellowship in microvascular reconstruction 
of the head and neck to begin July 1, 2003. Under the 
direction of Mark L. Urken and Eric M. Genden, fellows 
will be exposed to a broad range of reconstructive tech- 
niques with a significant emphasis on microvascular free 
tissue transfer. Candidates should be able to obtain a li- 
cense to practice medicine in the State of New York. 
Fellows are responsible for coordinating the weekly head 
and neck tumor board, as well as playing an active role in 
resident education. Applicants should have a strong back- 
ground in head and neck oncological surgery, as well as a 
commitment to pursuing an academic career, 
If interested, please submit a current curriculum vitae 
and two letters of recommendation to: 
Mark L. Urken, MD, FACS 
Professor and Chairman 
Department of Otolaryngology—Head and Neck Surgery 
Mount Sinai School of Medicine 
One Gustave L. Levy Place, Box 1189 
New York, NY 10029-6574 


FELLOWSHIP IN LARYNGOLOGY, 
PHONOSURGERY AND VOICE 
DISORDERS 


HARVARD 
MEDICAL SCHOOL 


Available July 1. 2004. The Department of Otology and 
Laryngology is seeking applicants for a one-year clinical 
fellowship in laryngology, phonosurgery, and management of 
the professional voice. The curriculum will provide a wide 
range of experiences, including phonomicrosurgery (cold 
instruments and lasers), laryngeal framework surgery, laryn- 
geal electromyography, and the use of botulinum toxin injec- 
tions for spasmodic dysphonia. The fellow will participate in 
the management of voice disorders as a member of a 
multidisciplinary team that includes full exposure to state-of 

the-art voice laboratory facilities. The curriculum is designed 
so that upon completion, the fellow will have sufficient 
background to establish an independent voice disorders pro- 
gram at his or her home institution. There are numerous 
clinical and basic science research opportunities available as 
well as the option to work collaboratively with local music 
conservatories. Certification by the American Board of Oto- 
laryngology or eligibility is a prerequisite. 

Direct inquiries to: Steven M. Zeitels. MD, Director. Division 
of Laryngology, Massachusetts Eye and Ear Infirmary, 
243 Charles Street, Boston, MA 02114: 

Phone: (617) 573-3557, Fax: (617) 573-3068 





ANNALS OF OTOLOGY, RHINOLOGY & LARYNGOLOGY 


ANNOUNCEMENT RATES 


Quarter page 
Half page 
Full page 


Material must be received 30 days prior to the month of issue; eg, for the February issue the due date is 


January 1. 


CHANGE OF ADDRESS 


Address changes must be received 45 days prior to the effective date so that your subscription will be 


received without interruption. 


CLAIMS 


Claims for missing issues must be received within 60 days of the month of publication for the United States 


and Canada, and 180 days for other countries. 


Annals Publishing Company 
4507 Laclede Avenue 
St Louis, MO 63108 
(314) 367-4987 
Fax: (314) 367-4988 
E-mail: manager@ Annals.com 


$220 
370 
645 























ë 
Visit our Web site at 
www.aveloxusa.com 


Avelo: i 
moxifloxacin HC! Tablêts 


F 4| 


| Æ | : 


© 2002, Bayer Corporation AV-0401 





